Training Input-Output Recurrent Neural Networks through Spectral Methods

doi:10.48550/arXiv.1603.00954

Training Input-Output Recurrent Neural Networks through Spectral Methods

We consider the problem of training input-output recurrent neural networks (RNN) for sequence labeling tasks. We propose a novel spectral approach for learning the network parameters. It is based on decomposition of the cross-moment tensor between the output and a non-linear transformation of the input, based on score functions. We guarantee consistent learning with polynomial sample and computational complexity under transparent conditions such as non-degeneracy of model parameters, polynomial activations for the neurons, and a Markovian evolution of the input sequence. We also extend our results to Bidirectional RNN which uses both previous and future information to output the label at each time point, and is employed in many NLP tasks such as POS tagging.

Publication:

arXiv e-prints

Pub Date:

March 2016

DOI:

10.48550/arXiv.1603.00954

arXiv:

arXiv:1603.00954

Bibcode:

2016arXiv160300954S

Keywords:

Computer Science - Machine Learning;
Computer Science - Neural and Evolutionary Computing;
Statistics - Machine Learning

NASA/ADS

Training Input-Output Recurrent Neural Networks through Spectral Methods

Abstract