Automatic Extraction of Medication Names in Tweets as Named Entity Recognition

doi:10.48550/arXiv.2111.15641

Automatic Extraction of Medication Names in Tweets as Named Entity Recognition

Social media posts contain potentially valuable information about medical conditions and health-related behavior. Biocreative VII Task 3 focuses on mining this information by recognizing mentions of medications and dietary supplements in tweets. We approach this task by fine tuning multiple BERT-style language models to perform token-level classification, and combining them into ensembles to generate final predictions. Our best system consists of five Megatron-BERT-345M models and achieves a strict F1 score of 0.764 on unseen test data.

Publication:

arXiv e-prints

Pub Date:

November 2021

DOI:

10.48550/arXiv.2111.15641

arXiv:

arXiv:2111.15641

Bibcode:

2021arXiv211115641A

Keywords:

Computer Science - Computation and Language

E-Print:

Submission to the BioCreative VII challenge - Track-3

NASA/ADS

Automatic Extraction of Medication Names in Tweets as Named Entity Recognition

Abstract