Bandit optimisation of functions in the Matérn kernel RKHS

doi:10.48550/arXiv.2001.10396

Bandit optimisation of functions in the Matérn kernel RKHS

We consider the problem of optimising functions in the reproducing kernel Hilbert space (RKHS) of a Matérn kernel with smoothness parameter $\nu$ over the domain $[0,1]^d$ under noisy bandit feedback. Our contribution, the $\pi$-GP-UCB algorithm, is the first practical approach with guaranteed sublinear regret for all $\nu>1$ and $d \geq 1$. Empirical validation suggests better performance and drastically improved computational scalablity compared with its predecessor, Improved GP-UCB.

Publication:

arXiv e-prints

Pub Date:

January 2020

DOI:

10.48550/arXiv.2001.10396

arXiv:

arXiv:2001.10396

Bibcode:

2020arXiv200110396J

Keywords:

Computer Science - Machine Learning;
Statistics - Machine Learning

E-Print:

Included an errata highlighting an omission in the proof of lemma 1 and pointing to a fix in the author's thesis

NASA/ADS

Bandit optimisation of functions in the Matérn kernel RKHS

Abstract