Improving Text Auto-Completion with Next Phrase Prediction

doi:10.48550/arXiv.2109.07067

Improving Text Auto-Completion with Next Phrase Prediction

Language models such as GPT-2 have performed well on constructing syntactically sound sentences for text auto-completion task. However, such models often require considerable training effort to adapt to specific writing domains (e.g., medical). In this paper, we propose an intermediate training strategy to enhance pre-trained language models' performance in the text auto-completion task and fastly adapt them to specific domains. Our strategy includes a novel self-supervised training objective called Next Phrase Prediction (NPP), which encourages a language model to complete the partial query with enriched phrases and eventually improve the model's text auto-completion performance. Preliminary experiments have shown that our approach is able to outperform the baselines in auto-completion for email and academic writing domains.

Publication:

arXiv e-prints

Pub Date:

September 2021

DOI:

10.48550/arXiv.2109.07067

arXiv:

arXiv:2109.07067

Bibcode:

2021arXiv210907067L

Keywords:

Computer Science - Computation and Language

E-Print:

4 pages, 2 figures, 4 tables, Accepted in EMNLP 2021-Findings

NASA/ADS

Improving Text Auto-Completion with Next Phrase Prediction

Abstract