Occupation measures arising in finite stochastic games

doi:10.48550/arXiv.2003.02487

Occupation measures arising in finite stochastic games

Shapley (1953) introduced two-player zero-sum discounted stochastic games, henceforth stochastic games, a model where a state variable follows a two-controlled Markov chain, the players receive rewards at each stage which add up to $0$, and each maximizes the normalized $\la$-discounted sum of stage rewards, for some fixed discount rate $\la\in(0,1]$. In this paper, we study asymptotic occupation measures arising in these games, as the discount rate goes to $0$.

Publication:

arXiv e-prints

Pub Date:

March 2020

DOI:

10.48550/arXiv.2003.02487

arXiv:

arXiv:2003.02487

Bibcode:

2020arXiv200302487J

Keywords:

Mathematics - Optimization and Control;
Mathematics - Probability;
91A06;
91A15

ADS

Occupation measures arising in finite stochastic games

Abstract