Foolproof Cooperative Learning

doi:10.48550/arXiv.1906.09831

Foolproof Cooperative Learning

This paper extends the notion of learning equilibrium in game theory from matrix games to stochastic games. We introduce Foolproof Cooperative Learning (FCL), an algorithm that converges to a Tit-for-Tat behavior. It allows cooperative strategies when played against itself while being not exploitable by selfish players. We prove that in repeated symmetric games, this algorithm is a learning equilibrium. We illustrate the behavior of FCL on symmetric matrix and grid games, and its robustness to selfish learners.

Publication:

arXiv e-prints

Pub Date:

June 2019

DOI:

10.48550/arXiv.1906.09831

arXiv:

arXiv:1906.09831

Bibcode:

2019arXiv190609831J

Keywords:

Computer Science - Computer Science and Game Theory;
Computer Science - Artificial Intelligence;
Computer Science - Machine Learning

E-Print:

Proceedings of The 12th Asian Conference on Machine Learning, PMLR 129:401-416, 2020

NASA/ADS

Foolproof Cooperative Learning

Abstract