Tree-to-Sequence Attentional Neural Machine Translation
Abstract
Most of the existing Neural Machine Translation (NMT) models focus on the conversion of sequential data and do not directly use syntactic information. We propose a novel end-to-end syntactic NMT model, extending a sequence-to-sequence model with the source-side phrase structure. Our model has an attention mechanism that enables the decoder to generate a translated word while softly aligning it with phrases as well as words of the source sentence. Experimental results on the WAT'15 English-to-Japanese dataset demonstrate that our proposed model considerably outperforms sequence-to-sequence attentional NMT models and compares favorably with the state-of-the-art tree-to-string SMT system.
- Publication:
-
arXiv e-prints
- Pub Date:
- March 2016
- DOI:
- 10.48550/arXiv.1603.06075
- arXiv:
- arXiv:1603.06075
- Bibcode:
- 2016arXiv160306075E
- Keywords:
-
- Computer Science - Computation and Language
- E-Print:
- Accepted as a full paper at the 54th Annual Meeting of the Association for Computational Linguistics (ACL 2016)