Neural Sign Language Translation by Learning Tokenization

doi:10.48550/arXiv.2002.00479

Neural Sign Language Translation by Learning Tokenization

Sign Language Translation has attained considerable success recently, raising hopes for improved communication with the Deaf. A pre-processing step called tokenization improves the success of translations. Tokens can be learned from sign videos if supervised data is available. However, data annotation at the gloss level is costly, and annotated data is scarce. The paper utilizes Adversarial, Multitask, Transfer Learning to search for semi-supervised tokenization approaches without burden of additional labeling. It provides extensive experiments to compare all the methods in different settings to conduct a deeper analysis. In the case of no additional target annotation besides sentences, the proposed methodology attains 13.25 BLUE-4 and 36.28 ROUGE scores which improves the current state-of-the-art by 4 points in BLUE-4 and 5 points in ROUGE.

Publication:

arXiv e-prints

Pub Date:

February 2020

DOI:

10.48550/arXiv.2002.00479

arXiv:

arXiv:2002.00479

Bibcode:

2020arXiv200200479O

Keywords:

Computer Science - Computer Vision and Pattern Recognition

E-Print:

8 pages, 2 figures, FG 2020

NASA/ADS

Neural Sign Language Translation by Learning Tokenization

Abstract