Neural Networks Classifier for Data Selection in Statistical Machine Translation
Abstract
We address the data selection problem in statistical machine translation (SMT) as a classification task. The new data selection method is based on a neural network classifier. We present a new method description and empirical results proving that our data selection method provides better translation quality, compared to a state-of-the-art method (i.e., Cross entropy). Moreover, the empirical results reported are coherent across different language pairs.
- Publication:
-
arXiv e-prints
- Pub Date:
- December 2016
- DOI:
- 10.48550/arXiv.1612.05555
- arXiv:
- arXiv:1612.05555
- Bibcode:
- 2016arXiv161205555P
- Keywords:
-
- Computer Science - Computation and Language
- E-Print:
- Submitted to EACL'17