Orthogonal Random Features: Explicit Forms and Sharp Inequalities
Abstract
Random features have been introduced to scale up kernel methods via randomization techniques. In particular, random Fourier features and orthogonal random features were used to approximate the popular Gaussian kernel. Random Fourier features are built in this case using a random Gaussian matrix. In this work, we analyze the bias and the variance of the kernel approximation based on orthogonal random features which makes use of Haar orthogonal matrices. We provide explicit expressions for these quantities using normalized Bessel functions, showing that orthogonal random features does not approximate the Gaussian kernel but a Bessel kernel. We also derive sharp exponential bounds supporting the view that orthogonal random features are less dispersed than random Fourier features.
- Publication:
-
arXiv e-prints
- Pub Date:
- October 2023
- DOI:
- arXiv:
- arXiv:2310.07370
- Bibcode:
- 2023arXiv231007370D
- Keywords:
-
- Computer Science - Machine Learning;
- Mathematics - Probability;
- Statistics - Machine Learning
- E-Print:
- Published in Transactions on Machine Learning Research (09/2024)