Killed Markov Decision Processes on Finite Time Interval for Countable Models
Abstract
We consider killed Markov decision processes for countable models on a finite time-interval. Existence of a uniform $\varepsilon$-optimal policy is proven. We show the correctness of the fundamental equation. The optimal control problem is reduced to a similar problem for the derived model. We receive an optimality equation and a method for the construction of simple optimal policies. The sufficiency of simple policies for countable models is proven. We show the correctness of the Markovian property. Additionally, a dynamic programming principle is considered.
- Publication:
-
arXiv e-prints
- Pub Date:
- April 2013
- DOI:
- 10.48550/arXiv.1304.2495
- arXiv:
- arXiv:1304.2495
- Bibcode:
- 2013arXiv1304.2495P
- Keywords:
-
- Mathematics - Optimization and Control;
- Mathematics - Probability;
- 90C40
- E-Print:
- Revised and corrected version