Finding Social Media Trolls: Dynamic Keyword Selection Methods for Rapidly-Evolving Online Debates
Abstract
Online harassment is a significant social problem. Prevention of online harassment requires rapid detection of harassing, offensive, and negative social media posts. In this paper, we propose the use of word embedding models to identify offensive and harassing social media messages in two aspects: detecting fast-changing topics for more effective data collection and representing word semantics in different domains. We demonstrate with preliminary results that using the GloVe (Global Vectors for Word Representation) model facilitates the discovery of new and relevant keywords to use for data collection and trolling detection. Our paper concludes with a discussion of a research agenda to further develop and test word embedding models for identification of social media harassment and trolling.
- Publication:
-
arXiv e-prints
- Pub Date:
- November 2019
- DOI:
- 10.48550/arXiv.1911.05332
- arXiv:
- arXiv:1911.05332
- Bibcode:
- 2019arXiv191105332L
- Keywords:
-
- Computer Science - Machine Learning;
- Computer Science - Computers and Society;
- Computer Science - Social and Information Networks;
- Statistics - Machine Learning
- E-Print:
- AI for Social Good workshop at NeurIPS (2019)