Learning to Transduce with Unbounded Memory
Abstract
Recently, strong results have been demonstrated by Deep Recurrent Neural Networks on natural language transduction problems. In this paper we explore the representational power of these models using synthetic grammars designed to exhibit phenomena similar to those found in real transduction problems such as machine translation. These experiments lead us to propose new memory-based recurrent networks that implement continuously differentiable analogues of traditional data structures such as Stacks, Queues, and DeQues. We show that these architectures exhibit superior generalisation performance to Deep RNNs and are often able to learn the underlying generating algorithms in our transduction experiments.
- Publication:
-
arXiv e-prints
- Pub Date:
- June 2015
- DOI:
- 10.48550/arXiv.1506.02516
- arXiv:
- arXiv:1506.02516
- Bibcode:
- 2015arXiv150602516G
- Keywords:
-
- Computer Science - Neural and Evolutionary Computing;
- Computer Science - Computation and Language;
- Computer Science - Machine Learning;
- 68T05;
- I.5.1;
- I.2.6;
- I.2.7
- E-Print:
- 14 pages, 4 figures, NIPS 2015