The Vera C. Rubin Observatory Data Butler and pipeline execution system
Abstract
The Rubin Observatory's Data Butler is designed to allow data file location and file formats to be abstracted away from the people writing the science pipeline algorithms. The Butler works in conjunction with the workflow graph builder to allow pipelines to be constructed from the algorithmic tasks. These pipelines can be executed at scale using object stores and multi-node clusters, or on a laptop using a local file system. The Butler and pipeline system are now in daily use during Rubin construction and early operations.
- Publication:
-
Software and Cyberinfrastructure for Astronomy VII
- Pub Date:
- August 2022
- DOI:
- arXiv:
- arXiv:2206.14941
- Bibcode:
- 2022SPIE12189E..11J
- Keywords:
-
- Astrophysics - Instrumentation and Methods for Astrophysics;
- Computer Science - Distributed;
- Parallel;
- and Cluster Computing
- E-Print:
- 14 pages, 3 figures, submitted to Proc SPIE 12189, "Software and Cyberinfrastructure for Astronomy VII", Montreal, CA, July 2022