Accurate estimation of influenza epidemics using Google search data via ARGO
Abstract
Big data generated from the Internet have great potential in tracking and predicting massive social activities. In this article, we focus on tracking influenza epidemics. We propose a model that utilizes publicly available Google search data to estimate current influenza-like illness activity level. Our model outperforms all available Google-search-based real-time tracking models for influenza epidemics at the national level of the United States, including Google Flu Trends. Our model is flexible, self-correcting, robust, and scalable, making it a potentially powerful tool that can be used for estimation and prediction at multiple temporal and spatial resolutions for other social events.
- Publication:
-
Proceedings of the National Academy of Science
- Pub Date:
- November 2015
- DOI:
- 10.1073/pnas.1515373112
- arXiv:
- arXiv:1505.00864
- Bibcode:
- 2015PNAS..11214473Y
- Keywords:
-
- Statistics - Applications;
- Computer Science - Social and Information Networks;
- Statistics - Machine Learning
- E-Print:
- 23 pages, 2 figures, Proceedings of the National Academy of Sciences (2015)