Global Unique Identification of Geoscience Samples: The International Geo Sample Number (IGSN) and the System for Earth Sample Registration (SESAR)
Abstract
Data on samples represent a primary foundation of Geoscience research across disciplines, ranging from the study of climate change, to biogeochemical cycles, to mantle and continental dynamics and are key to our knowledge of the Earth's dynamical systems and evolution. Different data types are generated for individual samples by different research groups, published in different papers, and stored in different databases on a global scale. The utility of these data is critically dependent on their integration. Such integration can be achieved within a Geoscience Cyberinfrastructure, but requires unambiguous identification of samples. Currently, naming of samples is arbitrary and inconsistent and therefore severely limits our ability to share, link, and integrate sample-based data. Major problems include name duplication, and changing of names as a sample is passed along over many years to different investigators. SESAR, the System for Earth Sample Registration (http://www.geosamples.org), addresses this problem by building a registry that generates and administers globally unique identifiers for Geoscience samples: the International Geo Sample Number (IGSN). Implementation of the IGSN in data publication and digital data management will dramatically advance interoperability among information systems for sample-based data, opening an extensive range of new opportunities for discovery and for interdisciplinary approaches in research. The IGSN will also facilitate the ability of investigators to build on previously collected data on samples as new measurements are made or new techniques are developed. With potentially broad application to all types of Geoscience samples, SESAR is global in scope. It is a web-based system that can be easily accessed by individual users through an interactive web interface and by distributed client systems via standard web services. Samples can be registered individually or in batches and at various levels of granularity from entire cores or dredges or sample suites to individual samples to sub-samples such as splits and separates. Relationships between `parent' and `child' samples are tracked. The system generates bar codes that users can download as images for labeling purposes. SESAR released a beta version of the registry in April 2005 that allows users to register a limited range of sample types. Identifiers generated by the beta version will remain valid when SESAR moves into its operational stage. Since then more than 3700 samples have been registered in SESAR. Registration of samples at a central clearinghouse will automatically build a global catalog of Geoscience samples, which will become a hugely valuable resource for the Geoscience community that allows more efficient planning of field and laboratory projects and facilitates sharing of samples, which will help build more comprehensive data sets for individual samples. The SESAR catalog will provide links to sample profiles on external systems that hold data about samples, thereby enabling users to easily obtain complete information about samples.
- Publication:
-
AGU Fall Meeting Abstracts
- Pub Date:
- December 2005
- Bibcode:
- 2005AGUFMIN43C..05L
- Keywords:
-
- 3640 Igneous petrology;
- 3675 Sedimentary petrology;
- 6620 Science policy (0485);
- 9805 Instruments useful in three or more fields;
- 9810 New fields (not classifiable under other headings)