On Strategic Measures and Optimality Properties in Discrete-Time Stochastic Control with Universally Measurable Policies
Abstract
This paper concerns discrete-time infinite-horizon stochastic control systems with Borel state and action spaces and universally measurable policies. We study optimization problems on strategic measures induced by the policies in these systems. The results are then applied to risk-neutral and risk-sensitive Markov decision processes, as well as their partially observable counterparts, to establish the measurability of the optimal value functions and the existence of universally measurable, randomized or nonrandomized, $\epsilon$-optimal policies, for a variety of average cost criteria and risk criteria. We also extend our analysis to a class of minimax control problems and establish similar optimality results under the axiom of analytic determinacy.
- Publication:
-
arXiv e-prints
- Pub Date:
- June 2022
- DOI:
- arXiv:
- arXiv:2206.06492
- Bibcode:
- 2022arXiv220606492Y
- Keywords:
-
- Mathematics - Optimization and Control;
- 90C40;
- 93E20;
- 60J05
- E-Print:
- 37 pages. This version corrects minor typos. Some of the results in this work are improvements of the author's earlier results given in Section 3.1 of arXiv:2104.00181v1. A shorter version of this paper is to be published in the journal Mathematics of Operations Research