Star-Shaped Space of Solutions of the Spherical Negative Perceptron
Abstract
Empirical studies on the landscape of neural networks have shown that low-energy configurations are often found in complex connected structures, where zero-energy paths between pairs of distant solutions can be constructed. Here, we consider the spherical negative perceptron, a prototypical nonconvex neural network model framed as a continuous constraint satisfaction problem. We introduce a general analytical method for computing energy barriers in the simplex with vertex configurations sampled from the equilibrium. We find that in the overparametrized regime the solution manifold displays simple connectivity properties. There exists a large geodesically convex component that is attractive for a wide range of optimization dynamics. Inside this region we identify a subset of atypical high-margin solutions that are geodesically connected with most other solutions, giving rise to a star-shaped geometry. We analytically characterize the organization of the connected space of solutions and show numerical evidence of a transition, at larger constraint densities, where the aforementioned simple geodesic connectivity breaks down.
- Publication:
-
Physical Review Letters
- Pub Date:
- December 2023
- DOI:
- 10.1103/PhysRevLett.131.227301
- arXiv:
- arXiv:2305.10623
- Bibcode:
- 2023PhRvL.131v7301A
- Keywords:
-
- Condensed Matter - Disordered Systems and Neural Networks;
- Computer Science - Machine Learning;
- Mathematics - Probability;
- Mathematics - Statistics Theory
- E-Print:
- 27 pages, 16 figures, comments are welcome