AutoExtend: Extending Word Embeddings to Embeddings for Synsets and Lexemes
Abstract
We present \textit{AutoExtend}, a system to learn embeddings for synsets and lexemes. It is flexible in that it can take any word embeddings as input and does not need an additional training corpus. The synset/lexeme embeddings obtained live in the same vector space as the word embeddings. A sparse tensor formalization guarantees efficiency and parallelizability. We use WordNet as a lexical resource, but AutoExtend can be easily applied to other resources like Freebase. AutoExtend achieves state-of-the-art performance on word similarity and word sense disambiguation tasks.
- Publication:
-
arXiv e-prints
- Pub Date:
- July 2015
- DOI:
- 10.48550/arXiv.1507.01127
- arXiv:
- arXiv:1507.01127
- Bibcode:
- 2015arXiv150701127R
- Keywords:
-
- Computer Science - Computation and Language
- E-Print:
- doi:10.3115/v1/P15-1173