Hate Speech Detection from Code-mixed Hindi-English Tweets Using Deep Learning Models
Abstract
This paper reports an increment to the state-of-the-art in hate speech detection for English-Hindi code-mixed tweets. We compare three typical deep learning models using domain-specific embeddings. On experimenting with a benchmark dataset of English-Hindi code-mixed tweets, we observe that using domain-specific embeddings results in an improved representation of target groups, and an improved F-score.
- Publication:
-
arXiv e-prints
- Pub Date:
- November 2018
- DOI:
- 10.48550/arXiv.1811.05145
- arXiv:
- arXiv:1811.05145
- Bibcode:
- 2018arXiv181105145K
- Keywords:
-
- Computer Science - Computation and Language
- E-Print:
- This paper will appear at the 15th International Conference on Natural Language Processing (ICON-2018) in India in December 2018. ICON is a premier NLP conference in India