Automatic Detection of Omissions in Translations
Abstract
ADOMIT is an algorithm for Automatic Detection of OMIssions in Translations. The algorithm relies solely on geometric analysis of bitext maps and uses no linguistic information. This property allows it to deal equally well with omissions that do not correspond to linguistic units, such as might result from word-processing mishaps. ADOMIT has proven itself by discovering many errors in a hand-constructed gold standard for evaluating bitext mapping algorithms. Quantitative evaluation on simulated omissions showed that, even with today's poor bitext mapping technology, ADOMIT is a valuable quality control tool for translators and translation bureaus.
- Publication:
-
arXiv e-prints
- Pub Date:
- September 1996
- DOI:
- 10.48550/arXiv.cmp-lg/9609010
- arXiv:
- arXiv:cmp-lg/9609010
- Bibcode:
- 1996cmp.lg....9010M
- Keywords:
-
- Computer Science - Computation and Language
- E-Print:
- 6 pages, minor revisions on Sept. 30, 1996