Phase transitions in soft-committee machines
Abstract
Equilibrium statistical physics is applied to the off-line training of layered neural networks with differentiable activation functions. A first analysis of soft-committee machines with an arbitrary number (K) of hidden units and continuous weights learning a perfectly matching rule is performed. Our results are exact in the limit of high training temperatures (β → 0). For K = 2 we find a second-order phase transition from unspecialized to specialized student configurations at a critical size P of the training set, whereas for K >= 3 the transition is first order. The limit K → ∞ can be performed analytically, the transition occurs after presenting on the order of NK/β examples. However, an unspecialized metastable state persists up to P propto NK2/β.
- Publication:
-
EPL (Europhysics Letters)
- Pub Date:
- October 1998
- DOI:
- arXiv:
- arXiv:cond-mat/9805182
- Bibcode:
- 1998EL.....44..261B
- Keywords:
-
- 87.10.+e;
- 07.05.Mh;
- 05.90.+m;
- General theory and mathematical aspects;
- Neural networks fuzzy logic artificial intelligence;
- Other topics in statistical physics thermodynamics and nonlinear dynamical systems;
- Condensed Matter - Disordered Systems and Neural Networks;
- Condensed Matter - Statistical Mechanics
- E-Print:
- 8 pages, 4 figures