The NASA Astrophysics Data System (ADS) has become the standard tool for searching the literature in the astronomy and astrophysics community. Within the ADS we are now consolidating the article reference catalog. Reference sources come in a variety of data formats. OCR'ed scanned articles (HTML, LaTeX, XML...) from a large number of different publications. We present in this paper new developments allowing the automation of the reference digester through a set of highly configurable, object-oriented, Python/XML applications and tools. We expect the use of these tools to ease the burden of incorporating new publications to the reference databases.
Astronomical Data Analysis Software and Systems XI
- Pub Date: