Bounds on learning in polynomial time
Abstract
The performance of large neural networks can be judged not only by their storage capacity but also by the time required for learning. A polynomial learning algorithm with learning time $\sim N^2$ in a network with $N$ units might be practical whereas a learning time $\sim e^N$ would allow rather small networks only. The question of absolute storage capacity $\alpha_c$ and capacity for polynomial learning rules $\alpha_p$ is discussed for several feed-forward architectures, the perceptron, the binary perceptron, the committee machine and a perceptron with fixed weights in the first layer and adaptive weights in the second layer. The analysis is based partially on dynamic mean field theory which is valid for $N\to\infty$. Especially for the committee machine a value $\alpha_p$ considerably lower than the capacity predicted by replica theory or simulations is found. This discrepancy is resolved by new simulations investigating the learning time dependence and revealing subtleties in the definition of the capacity.
- Publication:
-
Philosophical Magazine, Part B
- Pub Date:
- May 1998
- DOI:
- 10.1080/13642819808205041
- arXiv:
- arXiv:cond-mat/9705259
- Bibcode:
- 1998PMagB..77.1495H
- Keywords:
-
- Condensed Matter - Disordered Systems and Neural Networks
- E-Print:
- 12 pages Latex with 8 eps figures