Neural Networks Classifier for Data Selection in Statistical Machine Translation

doi:10.48550/arXiv.1612.05555

Neural Networks Classifier for Data Selection in Statistical Machine Translation

We address the data selection problem in statistical machine translation (SMT) as a classification task. The new data selection method is based on a neural network classifier. We present a new method description and empirical results proving that our data selection method provides better translation quality, compared to a state-of-the-art method (i.e., Cross entropy). Moreover, the empirical results reported are coherent across different language pairs.

Publication:

arXiv e-prints

Pub Date:

December 2016

DOI:

10.48550/arXiv.1612.05555

arXiv:

arXiv:1612.05555

Bibcode:

2016arXiv161205555P

Keywords:

Computer Science - Computation and Language

E-Print:

Submitted to EACL'17

NASA/ADS

Neural Networks Classifier for Data Selection in Statistical Machine Translation

Abstract