Persistence paths and signature features in topological data analysis
Abstract
We introduce a new feature map for barcodes that arise in persistent homology computation. The main idea is to first realize each barcode as a path in a convenient vector space, and to then compute its path signature which takes values in the tensor algebra of that vector space. The composition of these two operations - barcode to path, path to tensor series - results in a feature map that has several desirable properties for statistical learning, such as universality and characteristicness, and achieves state-of-the-art results on common classification benchmarks.
- Publication:
-
arXiv e-prints
- Pub Date:
- June 2018
- DOI:
- 10.48550/arXiv.1806.00381
- arXiv:
- arXiv:1806.00381
- Bibcode:
- 2018arXiv180600381C
- Keywords:
-
- Statistics - Machine Learning;
- Computer Science - Machine Learning;
- Mathematics - Probability;
- Mathematics - Statistics Theory
- E-Print:
- Additional experiment and further details. To appear in IEEE Transactions on Pattern Analysis and Machine Intelligence