Topical interests and the mitigation of search engine bias
Abstract
Search engines have become key media for our scientific, economic, and social activities by enabling people to access information on the web despite its size and complexity. On the down side, search engines bias the traffic of users according to their page ranking strategies, and it has been argued that they create a vicious cycle that amplifies the dominance of established and already popular sites. This bias could lead to a dangerous monopoly of information. We show that, contrary to intuition, empirical data do not support this conclusion; popular sites receive far less traffic than predicted. We discuss a model that accurately predicts traffic data patterns by taking into consideration the topical interests of users and their searching behavior in addition to the way search engines rank pages. The heterogeneity of user interests explains the observed mitigation of search engines' popularity bias.
- Publication:
-
Proceedings of the National Academy of Science
- Pub Date:
- August 2006
- DOI:
- 10.1073/pnas.0605525103
- arXiv:
- arXiv:cs/0511005
- Bibcode:
- 2006PNAS..10312684F
- Keywords:
-
- PHYSICAL SCIENCES / COMPUTER SCIENCES;
- Computer Science - Computers and Society;
- Computer Science - Information Retrieval;
- Physics - Physics and Society;
- H.3.3;
- H.3.4;
- H.3.5;
- H.5.4;
- K.4.m
- E-Print:
- 9 pages, 8 figures, 2 appendices. The final version of this e-print has been published on the Proc. Natl. Acad. Sci. USA 103(34), 12684-12689 (2006), http://www.pnas.org/cgi/content/abstract/103/34/12684