Detection and Discovery of Misinformation Sources using Attributed Webgraphs
Abstract
Website reliability labels underpin almost all research in misinformation detection. However, misinformation sources often exhibit transient behavior, which makes many such labeled lists obsolete over time. We demonstrate that Search Engine Optimization (SEO) attributes provide strong signals for predicting news site reliability. We introduce a novel attributed webgraph dataset with labeled news domains and their connections to outlinking and backlinking domains. We demonstrate the success of graph neural networks in detecting news site reliability using these attributed webgraphs, and show that our baseline news site reliability classifier outperforms current SoTA methods on the PoliticalNews dataset, achieving an F1 score of 0.96. Finally, we introduce and evaluate a novel graph-based algorithm for discovering previously unknown misinformation news sources.
- Publication:
-
arXiv e-prints
- Pub Date:
- January 2024
- DOI:
- 10.48550/arXiv.2401.02379
- arXiv:
- arXiv:2401.02379
- Bibcode:
- 2024arXiv240102379C
- Keywords:
-
- Computer Science - Social and Information Networks;
- Computer Science - Computers and Society
- E-Print:
- doi:10.1609/icwsm.v18i1.31309