Fast Value Iteration for Goal-Directed Markov Decision Processes
Abstract
Planning problems where effects of actions are non-deterministic can be modeled as Markov decision processes. Planning problems are usually goal-directed. This paper proposes several techniques for exploiting the goal-directedness to accelerate value iteration, a standard algorithm for solving Markov decision processes. Empirical studies have shown that the techniques can bring about significant speedups.
- Publication:
-
arXiv e-prints
- Pub Date:
- February 2013
- DOI:
- 10.48550/arXiv.1302.1575
- arXiv:
- arXiv:1302.1575
- Bibcode:
- 2013arXiv1302.1575L
- Keywords:
-
- Computer Science - Artificial Intelligence
- E-Print:
- Appears in Proceedings of the Thirteenth Conference on Uncertainty in Artificial Intelligence (UAI1997)