Spectral convergence of diffusion maps: improved error bounds and an alternative normalisation
Abstract
Diffusion maps is a manifold learning algorithm widely used for dimensionality reduction. Using a sample from a distribution, it approximates the eigenvalues and eigenfunctions of associated Laplace-Beltrami operators. Theoretical bounds on the approximation error are however generally much weaker than the rates that are seen in practice. This paper uses new approaches to improve the error bounds in the model case where the distribution is supported on a hypertorus. For the data sampling (variance) component of the error we make spatially localised compact embedding estimates on certain Hardy spaces; we study the deterministic (bias) component as a perturbation of the Laplace-Beltrami operator's associated PDE, and apply relevant spectral stability results. Using these approaches, we match long-standing pointwise error bounds for both the spectral data and the norm convergence of the operator discretisation. We also introduce an alternative normalisation for diffusion maps based on Sinkhorn weights. This normalisation approximates a Langevin diffusion on the sample and yields a symmetric operator approximation. We prove that it has better convergence compared with the standard normalisation on flat domains, and present a highly efficient algorithm to compute the Sinkhorn weights.
- Publication:
-
arXiv e-prints
- Pub Date:
- June 2020
- DOI:
- arXiv:
- arXiv:2006.02037
- Bibcode:
- 2020arXiv200602037W
- Keywords:
-
- Mathematics - Statistics Theory;
- Computer Science - Machine Learning;
- Mathematics - Numerical Analysis;
- Mathematics - Probability;
- Statistics - Machine Learning;
- 35P15;
- 60J60;
- 62M05;
- 65D99
- E-Print:
- Electronic copy of the final peer-reviewed manuscript accepted for publication