Dynamic social learning under graph constraints
Abstract
We introduce a model of graph-constrained dynamic choice with reinforcement modeled by positively $\alpha$-homogeneous rewards. We show that its empirical process, which can be written as a stochastic approximation recursion with Markov noise, has the same probability law as a certain vertex reinforced random walk. We use this equivalence to show that for $\alpha > 0$, the asymptotic outcome concentrates around the optimum in a certain limiting sense when `annealed' by letting $\alpha\uparrow\infty$ slowly.
- Publication:
-
arXiv e-prints
- Pub Date:
- July 2020
- DOI:
- 10.48550/arXiv.2007.03983
- arXiv:
- arXiv:2007.03983
- Bibcode:
- 2020arXiv200703983A
- Keywords:
-
- Mathematics - Optimization and Control;
- Computer Science - Machine Learning;
- Mathematics - Probability