An Open Source Tool For Test and Evaluation of Schema.org Dataset Publishing
Abstract
Using schema.org structured metadata is a widely adopted approach for advertising the availability of web content in a manner that is readily machine processed. The approach is also becoming popular for advertising availability of scientific data collections, constituent datasets, and related information such as organizations and identities. Schema.org offers a rich, flexible, and extensible vocabulary for describing web accessible resources. Such flexibility means that content providers have many options for describing their resources. This in turn can create challenges for aggregators such as DataONE that provide services such as content indexing across multiple repositories participating in the federation. Federation of data resources is facilitated by adopting community-developed guidelines and best practices for schema.org implementation and extension such as those emerging through Science-on-schema.org, bioschemas.org, and geoschemas.org. In order to assist content providers implement federation-friendly schema.org publishing, DataONE has developed a tool that runs from the command line or through a web interface to evaluate and report on the web publishing functionality of a repository. Tools such as this facilitate compatible implementations that leverage the flexible, light-weight content publishing approach embodied by schema.org.
- Publication:
-
AGU Fall Meeting Abstracts
- Pub Date:
- December 2019
- Bibcode:
- 2019AGUFMIN22B..04V
- Keywords:
-
- 1916 Data and information discovery;
- INFORMATICS;
- 1936 Interoperability;
- INFORMATICS;
- 1946 Metadata;
- INFORMATICS;
- 1970 Semantic web and semantic integration;
- INFORMATICS