Fast Value Iteration for Goal-Directed Markov Decision Processes

doi:10.48550/arXiv.1302.1575

Fast Value Iteration for Goal-Directed Markov Decision Processes

Planning problems where effects of actions are non-deterministic can be modeled as Markov decision processes. Planning problems are usually goal-directed. This paper proposes several techniques for exploiting the goal-directedness to accelerate value iteration, a standard algorithm for solving Markov decision processes. Empirical studies have shown that the techniques can bring about significant speedups.

Publication:

arXiv e-prints

Pub Date:

February 2013

DOI:

10.48550/arXiv.1302.1575

arXiv:

arXiv:1302.1575

Bibcode:

2013arXiv1302.1575L

Keywords:

Computer Science - Artificial Intelligence

E-Print:

Appears in Proceedings of the Thirteenth Conference on Uncertainty in Artificial Intelligence (UAI1997)

NASA/ADS

Fast Value Iteration for Goal-Directed Markov Decision Processes

Abstract