Multi-scale exploration of convex functions and bandit convex optimization

doi:10.48550/arXiv.1507.06580

Multi-scale exploration of convex functions and bandit convex optimization

We construct a new map from a convex function to a distribution on its domain, with the property that this distribution is a multi-scale exploration of the function. We use this map to solve a decade-old open problem in adversarial bandit convex optimization by showing that the minimax regret for this problem is $\tilde{O}(\mathrm{poly}(n) \sqrt{T})$, where $n$ is the dimension and $T$ the number of rounds. This bound is obtained by studying the dual Bayesian maximin regret via the information ratio analysis of Russo and Van Roy, and then using the multi-scale exploration to solve the Bayesian problem.

Publication:

arXiv e-prints

Pub Date:

July 2015

DOI:

10.48550/arXiv.1507.06580

arXiv:

arXiv:1507.06580

Bibcode:

2015arXiv150706580B

Keywords:

Mathematics - Metric Geometry;
Computer Science - Machine Learning;
Mathematics - Optimization and Control;
Mathematics - Probability;
Statistics - Machine Learning

E-Print:

Preliminary version

NASA/ADS

Multi-scale exploration of convex functions and bandit convex optimization

Abstract