Simultaneous approximation of a smooth function and its derivatives by deep neural networks with piecewise-polynomial activations
Abstract
This paper investigates the approximation properties of deep neural networks with piecewise-polynomial activation functions. We derive the required depth, width, and sparsity of a deep neural network to approximate any Hölder smooth function up to a given approximation error in Hölder norms in such a way that all weights of this neural network are bounded by $1$. The latter feature is essential to control generalization errors in many statistical and machine learning applications.
- Publication:
-
arXiv e-prints
- Pub Date:
- June 2022
- DOI:
- arXiv:
- arXiv:2206.09527
- Bibcode:
- 2022arXiv220609527B
- Keywords:
-
- Mathematics - Numerical Analysis;
- Mathematics - Statistics Theory;
- Statistics - Machine Learning;
- 41A25;
- 41A15;
- 41A28;
- 68T07
- E-Print:
- 28 pages