Mining Scientific Papers for Bibliometrics: a (very) Brief Survey of Methods and Tools
Abstract
The Open Access movement in scientific publishing and search engines like Google Scholar have made scientific articles more broadly accessible. During the last decade, the availability of scientific papers in full text has become more and more widespread thanks to the growing number of publications on online platforms such as ArXiv and CiteSeer. The efforts to provide articles in machine-readable formats and the rise of Open Access publishing have resulted in a number of standardized formats for scientific papers (such as NLM-JATS, TEI, DocBook). Our aim is to stimulate research at the intersection of Bibliometrics and Computational Linguistics in order to study the ways Bibliometrics can benefit from large-scale text analytics and sense mining of scientific papers, thus exploring the interdisciplinarity of Bibliometrics and Natural Language Processing.
- Publication:
-
arXiv e-prints
- Pub Date:
- May 2015
- DOI:
- arXiv:
- arXiv:1505.01393
- Bibcode:
- 2015arXiv150501393A
- Keywords:
-
- Computer Science - Digital Libraries;
- Computer Science - Computation and Language
- E-Print:
- 2 pages, paper accepted for the 15th International Society of Scientometrics and Informetrics Conference (ISSI)