Word Embeddings for Chemical Patent Natural Language Processing

doi:10.48550/arXiv.2010.12912

Word Embeddings for Chemical Patent Natural Language Processing

We evaluate chemical patent word embeddings against known biomedical embeddings and show that they outperform the latter extrinsically and intrinsically. We also show that using contextualized embeddings can induce predictive models of reasonable performance for this domain over a relatively small gold standard.

Publication:

arXiv e-prints

Pub Date:

October 2020

DOI:

10.48550/arXiv.2010.12912

arXiv:

arXiv:2010.12912

Bibcode:

2020arXiv201012912T

Keywords:

Computer Science - Computation and Language;
Computer Science - Machine Learning;
68T50;
I.2.7;
J.3;
C.4

E-Print:

Extended version of an extended abstract presented (and reviewed) at the Latinx Workshop at ICML 2020

NASA/ADS

Word Embeddings for Chemical Patent Natural Language Processing

Abstract