Metric Entropy Limits on Recurrent Neural Network Learning of Linear Dynamical Systems

doi:10.48550/arXiv.2105.02556

Metric Entropy Limits on Recurrent Neural Network Learning of Linear Dynamical Systems

One of the most influential results in neural network theory is the universal approximation theorem [1, 2, 3] which states that continuous functions can be approximated to within arbitrary accuracy by single-hidden-layer feedforward neural networks. The purpose of this paper is to establish a result in this spirit for the approximation of general discrete-time linear dynamical systems - including time-varying systems - by recurrent neural networks (RNNs). For the subclass of linear time-invariant (LTI) systems, we devise a quantitative version of this statement. Specifically, measuring the complexity of the considered class of LTI systems through metric entropy according to [4], we show that RNNs can optimally learn - or identify in system-theory parlance - stable LTI systems. For LTI systems whose input-output relation is characterized through a difference equation, this means that RNNs can learn the difference equation from input-output traces in a metric-entropy optimal manner.

Publication:

arXiv e-prints

Pub Date:

May 2021

DOI:

10.48550/arXiv.2105.02556

arXiv:

arXiv:2105.02556

Bibcode:

2021arXiv210502556H

Keywords:

Computer Science - Machine Learning;
Computer Science - Information Theory;
Mathematics - Dynamical Systems

E-Print:

28 pages

NASA/ADS

Metric Entropy Limits on Recurrent Neural Network Learning of Linear Dynamical Systems

Abstract