Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences

doi:10.48550/arXiv.2009.11795

Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences

Domain adaptation or transfer learning using pre-trained language models such as BERT has proven to be an effective approach for many natural language processing tasks. In this work, we propose to formulate word sense disambiguation as a relevance ranking task, and fine-tune BERT on sequence-pair ranking task to select the most probable sense definition given a context sentence and a list of candidate sense definitions. We also introduce a data augmentation technique for WSD using existing example sentences from WordNet. Using the proposed training objective and data augmentation technique, our models are able to achieve state-of-the-art results on the English all-words benchmark datasets.

Publication:

arXiv e-prints

Pub Date:

September 2020

DOI:

10.48550/arXiv.2009.11795

arXiv:

arXiv:2009.11795

Bibcode:

2020arXiv200911795Y

Keywords:

Computer Science - Computation and Language;
Computer Science - Machine Learning

E-Print:

Accepted to appear in Findings of EMNLP 2020

NASA/ADS

Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences

Abstract