Generalization error in high-dimensional perceptrons: Approaching Bayes error with convex optimization

doi:10.48550/arXiv.2006.06560

Generalization error in high-dimensional perceptrons: Approaching Bayes error with convex optimization

We consider a commonly studied supervised classification of a synthetic dataset whose labels are generated by feeding a one-layer neural network with random iid inputs. We study the generalization performances of standard classifiers in the high-dimensional regime where $\alpha=n/d$ is kept finite in the limit of a high dimension $d$ and number of samples $n$. Our contribution is three-fold: First, we prove a formula for the generalization error achieved by $\ell_2$ regularized classifiers that minimize a convex loss. This formula was first obtained by the heuristic replica method of statistical physics. Secondly, focussing on commonly used loss functions and optimizing the $\ell_2$ regularization strength, we observe that while ridge regression performance is poor, logistic and hinge regression are surprisingly able to approach the Bayes-optimal generalization error extremely closely. As $\alpha \to \infty$ they lead to Bayes-optimal rates, a fact that does not follow from predictions of margin-based generalization error bounds. Third, we design an optimal loss and regularizer that provably leads to Bayes-optimal generalization error.

Publication:

arXiv e-prints

Pub Date:

June 2020

DOI:

10.48550/arXiv.2006.06560

arXiv:

arXiv:2006.06560

Bibcode:

2020arXiv200606560A

Keywords:

Statistics - Machine Learning;
Condensed Matter - Disordered Systems and Neural Networks;
Computer Science - Machine Learning;
Mathematics - Statistics Theory

E-Print:

11 pages + 45 pages Supplementary Material / 5 figures, v2 revised and accepted at NeurIPS

NASA/ADS

Generalization error in high-dimensional perceptrons: Approaching Bayes error with convex optimization

Abstract