A Divide-and-Conquer Approach to Persistent Homology

doi:10.48550/arXiv.2410.01839

A Divide-and-Conquer Approach to Persistent Homology

Persistent homology is a tool of topological data analysis that has been used in a variety of settings to characterize different dimensional holes in data. However, persistent homology computations can be memory intensive with a computational complexity that does not scale well as the data size becomes large. In this work, we propose a divide-and-conquer (DaC) method to mitigate these issues. The proposed algorithm efficiently finds small, medium, and large-scale holes by partitioning data into sub-regions and uses a Vietoris-Rips filtration. Furthermore, we provide theoretical results that quantify the bottleneck distance between DaC and the true persistence diagram and the recovery probability of holes in the data. We empirically verify that the rate coincides with our theoretical rate, and find that the memory and computational complexity of DaC outperforms an alternative method that relies on a clustering preprocessing step to reduce the memory and computational complexity of the persistent homology computations. Finally, we test our algorithm using spatial data of the locations of lakes in Wisconsin, where the classical persistent homology is computationally infeasible.

Publication:

arXiv e-prints

Pub Date:

September 2024

DOI:

10.48550/arXiv.2410.01839

arXiv:

arXiv:2410.01839

Bibcode:

2024arXiv241001839L

Keywords:

Mathematics - Algebraic Topology;
Statistics - Methodology

NASA/ADS

A Divide-and-Conquer Approach to Persistent Homology

Abstract