Tree-to-Sequence Attentional Neural Machine Translation

doi:10.48550/arXiv.1603.06075

Tree-to-Sequence Attentional Neural Machine Translation

Most of the existing Neural Machine Translation (NMT) models focus on the conversion of sequential data and do not directly use syntactic information. We propose a novel end-to-end syntactic NMT model, extending a sequence-to-sequence model with the source-side phrase structure. Our model has an attention mechanism that enables the decoder to generate a translated word while softly aligning it with phrases as well as words of the source sentence. Experimental results on the WAT'15 English-to-Japanese dataset demonstrate that our proposed model considerably outperforms sequence-to-sequence attentional NMT models and compares favorably with the state-of-the-art tree-to-string SMT system.

Publication:

arXiv e-prints

Pub Date:

March 2016

DOI:

10.48550/arXiv.1603.06075

arXiv:

arXiv:1603.06075

Bibcode:

2016arXiv160306075E

Keywords:

Computer Science - Computation and Language

E-Print:

Accepted as a full paper at the 54th Annual Meeting of the Association for Computational Linguistics (ACL 2016)

NASA/ADS

Tree-to-Sequence Attentional Neural Machine Translation

Abstract