Multi-Modal Discussion Transformer: Integrating Text, Images and Graph Transformers to Detect Hate Speech on Social Media

doi:10.48550/arXiv.2307.09312

Multi-Modal Discussion Transformer: Integrating Text, Images and Graph Transformers to Detect Hate Speech on Social Media

We present the Multi-Modal Discussion Transformer (mDT), a novel methodfor detecting hate speech in online social networks such as Reddit discussions. In contrast to traditional comment-only methods, our approach to labelling a comment as hate speech involves a holistic analysis of text and images grounded in the discussion context. This is done by leveraging graph transformers to capture the contextual relationships in the discussion surrounding a comment and grounding the interwoven fusion layers that combine text and image embeddings instead of processing modalities separately. To evaluate our work, we present a new dataset, HatefulDiscussions, comprising complete multi-modal discussions from multiple online communities on Reddit. We compare the performance of our model to baselines that only process individual comments and conduct extensive ablation studies.

Publication:

arXiv e-prints

Pub Date:

July 2023

DOI:

10.48550/arXiv.2307.09312

arXiv:

arXiv:2307.09312

Bibcode:

2023arXiv230709312H

Keywords:

Computer Science - Computation and Language;
Computer Science - Machine Learning;
Computer Science - Multimedia;
Computer Science - Social and Information Networks

E-Print:

Accepted to AAAI 2024 (AI for Social Impact Track)

NASA/ADS

Multi-Modal Discussion Transformer: Integrating Text, Images and Graph Transformers to Detect Hate Speech on Social Media

Abstract