Occupation measures arising in finite stochastic games
Abstract
Shapley (1953) introduced two-player zero-sum discounted stochastic games, henceforth stochastic games, a model where a state variable follows a two-controlled Markov chain, the players receive rewards at each stage which add up to $0$, and each maximizes the normalized $\la$-discounted sum of stage rewards, for some fixed discount rate $\la\in(0,1]$. In this paper, we study asymptotic occupation measures arising in these games, as the discount rate goes to $0$.
- Publication:
-
arXiv e-prints
- Pub Date:
- March 2020
- DOI:
- arXiv:
- arXiv:2003.02487
- Bibcode:
- 2020arXiv200302487J
- Keywords:
-
- Mathematics - Optimization and Control;
- Mathematics - Probability;
- 91A06;
- 91A15