Optimal Convergence Rates for Neural Operators

doi:10.48550/arXiv.2412.17518

Optimal Convergence Rates for Neural Operators

We introduce the neural tangent kernel (NTK) regime for two-layer neural operators and analyze their generalization properties. For early-stopped gradient descent (GD), we derive fast convergence rates that are known to be minimax optimal within the framework of non-parametric regression in reproducing kernel Hilbert spaces (RKHS). We provide bounds on the number of hidden neurons and the number of second-stage samples necessary for generalization. To justify our NTK regime, we additionally show that any operator approximable by a neural operator can also be approximated by an operator from the RKHS. A key application of neural operators is learning surrogate maps for the solution operators of partial differential equations (PDEs). We consider the standard Poisson equation to illustrate our theoretical findings with simulations.

Publication:

arXiv e-prints

Pub Date:

December 2024

DOI:

10.48550/arXiv.2412.17518

arXiv:

arXiv:2412.17518

Bibcode:

2024arXiv241217518N

Keywords:

Statistics - Machine Learning;
Computer Science - Machine Learning

ADS

Optimal Convergence Rates for Neural Operators

Abstract