Word Embeddings for Chemical Patent Natural Language Processing
Abstract
We evaluate chemical patent word embeddings against known biomedical embeddings and show that they outperform the latter extrinsically and intrinsically. We also show that using contextualized embeddings can induce predictive models of reasonable performance for this domain over a relatively small gold standard.
- Publication:
-
arXiv e-prints
- Pub Date:
- October 2020
- DOI:
- 10.48550/arXiv.2010.12912
- arXiv:
- arXiv:2010.12912
- Bibcode:
- 2020arXiv201012912T
- Keywords:
-
- Computer Science - Computation and Language;
- Computer Science - Machine Learning;
- 68T50;
- I.2.7;
- J.3;
- C.4
- E-Print:
- Extended version of an extended abstract presented (and reviewed) at the Latinx Workshop at ICML 2020