Toward two-dimensional search engines
Abstract
We study the statistical properties of various directed networks using ranking of their nodes based on the dominant vectors of the Google matrix known as PageRank and CheiRank. On average PageRank orders nodes proportionally to a number of ingoing links, while CheiRank orders nodes proportionally to a number of outgoing links. In this way, the ranking of nodes becomes two dimensional which paves the way for the development of two-dimensional search engines of a new type. Statistical properties of information flow on the PageRank-CheiRank plane are analyzed for networks of British, French and Italian universities, Wikipedia, Linux Kernel, gene regulation and other networks. A special emphasis is done for British universities networks using the large database publicly available in the UK. Methods of spam links control are also analyzed.
- Publication:
-
Journal of Physics A Mathematical General
- Pub Date:
- July 2012
- DOI:
- 10.1088/1751-8113/45/27/275101
- arXiv:
- arXiv:1106.6215
- Bibcode:
- 2012JPhA...45A5101E
- Keywords:
-
- Computer Science - Information Retrieval;
- Condensed Matter - Statistical Mechanics
- E-Print:
- 22 pages, 16 figures. Additional data available at http://www.quantware.ups-tlse.fr/QWLIB/dvvadi/