Dynamic Programming for Pure-Strategy Subgame Perfection in an Arbitrary Game
Abstract
This paper uses value functions to characterize the pure-strategy subgame-perfect equilibria of an arbitrary, possibly infinite-horizon game. It specifies the game's extensive form as a pentaform (Streufert 2023p, arXiv:2107.10801v4), which is a set of quintuples formalizing the abstract relationships between nodes, actions, players, and situations (situations generalize information sets). Because a pentaform is a set, this paper can explicitly partition the game form into piece forms, each of which starts at a (Selten) subroot and contains all subsequent nodes except those that follow a subsequent subroot. Then the set of subroots becomes the domain of a value function, and the piece-form partition becomes the framework for a value recursion which generalizes the Bellman equation from dynamic programming. The main results connect the value recursion with the subgame-perfect equilibria of the original game, under the assumptions of upper- and lower-convergence. Finally, a corollary characterizes subgame perfection as the absence of an improving one-piece deviation.
- Publication:
-
arXiv e-prints
- Pub Date:
- February 2023
- DOI:
- arXiv:
- arXiv:2302.03855
- Bibcode:
- 2023arXiv230203855S
- Keywords:
-
- Economics - Theoretical Economics;
- Computer Science - Computer Science and Game Theory;
- Mathematics - Optimization and Control;
- 90C39;
- 91A18
- E-Print:
- 56 pages, 7 figures. Version 3 provides better non-convergent examples in Section 2.4