Convergence Rate Analysis for Deep Ritz Method

doi:10.4208/cicp.OA-2021-0195

Convergence Rate Analysis for Deep Ritz Method

Using deep neural networks to solve PDEs has attracted a lot of attentions recently. However, why the deep learning method works is falling far behind its empirical success. In this paper, we provide a rigorous numerical analysis on deep Ritz method (DRM) \cite{wan11} for second order elliptic equations with Neumann boundary conditions. We establish the first nonasymptotic convergence rate in $H^1$ norm for DRM using deep networks with $\mathrm{ReLU}^2$ activation functions. In addition to providing a theoretical justification of DRM, our study also shed light on how to set the hyper-parameter of depth and width to achieve the desired convergence rate in terms of number of training samples. Technically, we derive bounds on the approximation error of deep $\mathrm{ReLU}^2$ network in $H^1$ norm and on the Rademacher complexity of the non-Lipschitz composition of gradient norm and $\mathrm{ReLU}^2$ network, both of which are of independent interest.

Publication:

Communications in Computational Physics

Pub Date:

June 2022

DOI:

10.4208/cicp.OA-2021-0195

arXiv:

arXiv:2103.13330

Bibcode:

2022CCoPh..31.1020D

Keywords:

Mathematics - Numerical Analysis

E-Print:

doi:10.4208/cicp.OA-2021-0195

NASA/ADS

Convergence Rate Analysis for Deep Ritz Method

Abstract