Derivative-Free &amp; Order-Robust Optimisation

doi:10.48550/arXiv.1910.04034

Derivative-Free & Order-Robust Optimisation

In this paper, we formalise order-robust optimisation as an instance of online learning minimising simple regret, and propose Vroom, a zero'th order optimisation algorithm capable of achieving vanishing regret in non-stationary environments, while recovering favorable rates under stochastic reward-generating processes. Our results are the first to target simple regret definitions in adversarial scenarios unveiling a challenge that has been rarely considered in prior work.

Publication:

arXiv e-prints

Pub Date:

October 2019

DOI:

10.48550/arXiv.1910.04034

arXiv:

arXiv:1910.04034

Bibcode:

2019arXiv191004034G

Keywords:

Computer Science - Machine Learning;
Statistics - Machine Learning

NASA/ADS

Derivative-Free & Order-Robust Optimisation

Abstract