A step beyond simple keyword searches: Services enabled by a full content digital journal archive
Abstract
The problems of managing and searching large archives of scientific journal articles can potentially be addressed through data mining and statistical techniques matured primarily for quantitative scientific data analysis. A journal paper could be represented by a multivariate descriptor, e.g., the occurrence counts of a number key technical terms or phrases (keywords), perhaps derived from a controlled vocabulary (e.g., the American Meteorological Society's Glossary of Meteorology) or bootstrapped from the journal archive itself. With this technique, conventional statistical classification tools can be leveraged to address challenges faced by both scientists and professional societies in knowledge management. For example, cluster analyses can be used to find bundles of "most-related" papers, and address the issue of journal bifurcation (when is a new journal necessary, and what topics should it encompass). Similarly, neural networks can be trained to predict the optimal journal (within a society's collection) in which a newly submitted paper should be published. Comparable techniques could enable very powerful end-user tools, all premised on the view of a paper as a data point in a multidimensional descriptor space, e.g.: "find papers most similar to the one I am reading", "build a personalized subscription service, based on the content of the papers I am interested in, rather than preselected keywords", "find suitable reviewers, based on the content of their own published works", etc. Such services may represent the next "quantum leap" beyond the rudimentary search interfaces currently provided to end-users, as well as a compelling value-added component needed to help bridge the print-to-digital-medium gap, and help stabilize professional societies' revenue stream during the print-to-digital transition.
- Publication:
-
AGU Fall Meeting Abstracts
- Pub Date:
- December 2003
- Bibcode:
- 2003AGUFMED32A1191B
- Keywords:
-
- 1700 HISTORY OF GEOPHYSICS;
- 1799 General or miscellaneous;
- 6605 Education;
- 6699 General or miscellaneous