Predicting the generalization gap in neural networks using topological data analysis

doi:10.48550/arXiv.2203.12330

Predicting the generalization gap in neural networks using topological data analysis

Understanding how neural networks generalize on unseen data is crucial for designing more robust and reliable models. In this paper, we study the generalization gap of neural networks using methods from topological data analysis. For this purpose, we compute homological persistence diagrams of weighted graphs constructed from neuron activation correlations after a training phase, aiming to capture patterns that are linked to the generalization capacity of the network. We compare the usefulness of different numerical summaries from persistence diagrams and show that a combination of some of them can accurately predict and partially explain the generalization gap without the need of a test set. Evaluation on two computer vision recognition tasks (CIFAR10 and SVHN) shows competitive generalization gap prediction when compared against state-of-the-art methods.

Publication:

arXiv e-prints

Pub Date:

March 2022

DOI:

10.48550/arXiv.2203.12330

arXiv:

arXiv:2203.12330

Bibcode:

2022arXiv220312330B

Keywords:

Computer Science - Machine Learning;
Mathematics - Algebraic Topology;
55N31;
68T07;
I.2.6

E-Print:

24 pages, 7 figures. The Related Work section has been updated and the experiments have been executed anew including a 5x2-fold cross-validation scheme. Figure 4.3 has been crucially improved thanks to the discovery that the clusters of neural networks that appear in that figure correspond to different depths of the corresponding architectures

NASA/ADS

Predicting the generalization gap in neural networks using topological data analysis

Abstract