Finding Social Media Trolls: Dynamic Keyword Selection Methods for Rapidly-Evolving Online Debates

doi:10.48550/arXiv.1911.05332

Finding Social Media Trolls: Dynamic Keyword Selection Methods for Rapidly-Evolving Online Debates

Online harassment is a significant social problem. Prevention of online harassment requires rapid detection of harassing, offensive, and negative social media posts. In this paper, we propose the use of word embedding models to identify offensive and harassing social media messages in two aspects: detecting fast-changing topics for more effective data collection and representing word semantics in different domains. We demonstrate with preliminary results that using the GloVe (Global Vectors for Word Representation) model facilitates the discovery of new and relevant keywords to use for data collection and trolling detection. Our paper concludes with a discussion of a research agenda to further develop and test word embedding models for identification of social media harassment and trolling.

Publication:

arXiv e-prints

Pub Date:

November 2019

DOI:

10.48550/arXiv.1911.05332

arXiv:

arXiv:1911.05332

Bibcode:

2019arXiv191105332L

Keywords:

Computer Science - Machine Learning;
Computer Science - Computers and Society;
Computer Science - Social and Information Networks;
Statistics - Machine Learning

E-Print:

AI for Social Good workshop at NeurIPS (2019)

NASA/ADS

Finding Social Media Trolls: Dynamic Keyword Selection Methods for Rapidly-Evolving Online Debates

Abstract