Predicting the future relevance of research institutions - The winning solution of the KDD Cup 2016
Abstract
The world's collective knowledge is evolving through research and new scientific discoveries. It is becoming increasingly difficult to objectively rank the impact research institutes have on global advancements. However, since the funding, governmental support, staff and students quality all mirror the projected quality of the institution, it becomes essential to measure the affiliation's rating in a transparent and widely accepted way. We propose and investigate several methods to rank affiliations based on the number of their accepted papers at future academic conferences. We carry out our investigation using publicly available datasets such as the Microsoft Academic Graph, a heterogeneous graph which contains various information about academic papers. We analyze several models, starting with a simple probabilities-based method and then gradually expand our training dataset, engineer many more features and use mixed models and gradient boosted decision trees models to improve our predictions.
- Publication:
-
arXiv e-prints
- Pub Date:
- September 2016
- DOI:
- 10.48550/arXiv.1609.02728
- arXiv:
- arXiv:1609.02728
- Bibcode:
- 2016arXiv160902728S
- Keywords:
-
- Computer Science - Machine Learning;
- Computer Science - Digital Libraries;
- Computer Science - Social and Information Networks;
- Physics - Physics and Society
- E-Print:
- 6 pages, KDD 2016, KDD Cup 2016