Effects of high-order correlations on personalized recommendations for bipartite networks
Abstract
In this paper, we introduce a modified collaborative filtering (MCF) algorithm, which has remarkably higher accuracy than the standard collaborative filtering. In the MCF, instead of the cosine similarity index, the user-user correlations are obtained by a diffusion process. Furthermore, by considering the second-order correlations, we design an effective algorithm that depresses the influence of mainstream preferences. Simulation results show that the algorithmic accuracy, measured by the average ranking score, is further improved by 20.45% and 33.25% in the optimal cases of MovieLens and Netflix data. More importantly, the optimal value λ depends approximately monotonously on the sparsity of the training set. Given a real system, we could estimate the optimal parameter according to the data sparsity, which makes this algorithm easy to be applied. In addition, two significant criteria of algorithmic performance, diversity and popularity, are also taken into account. Numerical results show that as the sparsity increases, the algorithm considering the second-order correlation can outperform the MCF simultaneously in all three criteria.
- Publication:
-
Physica A Statistical Mechanics and its Applications
- Pub Date:
- February 2010
- DOI:
- 10.1016/j.physa.2009.10.027
- arXiv:
- arXiv:0808.3726
- Bibcode:
- 2010PhyA..389..881L
- Keywords:
-
- Physics - Data Analysis;
- Statistics and Probability;
- Computer Science - Information Retrieval
- E-Print:
- Physica A 389, 881-886 (2010)