Monotonic Simultaneous Translation with Chunk-wise Reordering and Refinement

doi:10.48550/arXiv.2110.09646

Monotonic Simultaneous Translation with Chunk-wise Reordering and Refinement

Recent work in simultaneous machine translation is often trained with conventional full sentence translation corpora, leading to either excessive latency or necessity to anticipate as-yet-unarrived words, when dealing with a language pair whose word orders significantly differ. This is unlike human simultaneous interpreters who produce largely monotonic translations at the expense of the grammaticality of a sentence being translated. In this paper, we thus propose an algorithm to reorder and refine the target side of a full sentence translation corpus, so that the words/phrases between the source and target sentences are aligned largely monotonically, using word alignment and non-autoregressive neural machine translation. We then train a widely used wait-k simultaneous translation model on this reordered-and-refined corpus. The proposed approach improves BLEU scores and resulting translations exhibit enhanced monotonicity with source sentences.

Publication:

arXiv e-prints

Pub Date:

October 2021

DOI:

10.48550/arXiv.2110.09646

arXiv:

arXiv:2110.09646

Bibcode:

2021arXiv211009646H

Keywords:

Computer Science - Computation and Language;
Computer Science - Artificial Intelligence;
Computer Science - Machine Learning

E-Print:

To be published in WMT2021

NASA/ADS

Monotonic Simultaneous Translation with Chunk-wise Reordering and Refinement

Abstract