Statistical Analysis and Parameter Selection for Mapper
Abstract
In this article, we study the question of the statistical convergence of the 1-dimensional Mapper to its continuous analogue, the Reeb graph. We show that the Mapper is an optimal estimator of the Reeb graph, which gives, as a byproduct, a method to automatically tune its parameters and compute confidence regions on its topological features, such as its loops and flares. This allows to circumvent the issue of testing a large grid of parameters and keeping the most stable ones in the brute-force setting, which is widely used in visualization, clustering and feature selection with the Mapper.
- Publication:
-
arXiv e-prints
- Pub Date:
- June 2017
- DOI:
- 10.48550/arXiv.1706.00204
- arXiv:
- arXiv:1706.00204
- Bibcode:
- 2017arXiv170600204C
- Keywords:
-
- Computer Science - Computational Geometry;
- Mathematics - Algebraic Topology;
- Statistics - Methodology
- E-Print:
- Minor modifications