Algorithmic models of human behavior and stochastic optimization
Abstract
In this article we describe the solutions of three problems posed at different time by Yurii Nesterov. First problem is 'Mage vs Experts': Assume that we live in some enviroment that characterized by unobservable convex function $f(x)$. Each day we can observe only the stochastic gradient of $f(x)$ (with heavy tails). We want to know where is a minimum of $f(x)$. What is better to live one long live or to live in parallel many short lives? The second problem is a variant of the previous one (we'd like to find the minimum of convex function $f(x)$) but we can observe only the (noise) value of the function at each day. The additional difficulties here is $f(x)$ isn't necessarily smooth and $x$ belongs to unit simplex. The third problem is to interprete equillibrium in Stable dynamic traffic assignment model (Nesterov-dePalma, 2000) as atractor of the best response dynamic. All these problems required proper generalizations of mirror descent (MD) algorithm. In this paper we collect different special versions of MD to solve the described problems.
- Publication:
-
arXiv e-prints
- Pub Date:
- November 2018
- DOI:
- arXiv:
- arXiv:1811.04905
- Bibcode:
- 2018arXiv181104905B
- Keywords:
-
- Mathematics - Optimization and Control
- E-Print:
- in Russian, 23 pages