Towards real-time community detection in large networks
Abstract
The recent boom of large-scale online social networks (OSNs) both enables and necessitates the use of parallelizable and scalable computational techniques for their analysis. We examine the problem of real-time community detection and a recently proposed linear time— O(m) on a network with m edges—label propagation, or “epidemic” community detection algorithm. We identify characteristics and drawbacks of the algorithm and extend it by incorporating different heuristics to facilitate reliable and multifunctional real-time community detection. With limited computational resources, we employ the algorithm on OSN data with 1×106 nodes and about 58×106 directed edges. Experiments and benchmarks reveal that the extended algorithm is not only faster but its community detection accuracy compares favorably over popular modularity-gain optimization algorithms known to suffer from their resolution limits.
- Publication:
-
Physical Review E
- Pub Date:
- June 2009
- DOI:
- 10.1103/PhysRevE.79.066107
- arXiv:
- arXiv:0808.2633
- Bibcode:
- 2009PhRvE..79f6107L
- Keywords:
-
- 89.75.Hc;
- 87.23.Ge;
- 89.20.Hh;
- 05.10.-a;
- Networks and genealogical trees;
- Dynamics of social systems;
- World Wide Web Internet;
- Computational methods in statistical physics and nonlinear dynamics;
- Physics - Physics and Society
- E-Print:
- 10 pages, 11 figures