SpecPT (Spectroscopy Pre-trained Transformer) Model for Extragalactic Spectroscopy: I. Architecture and Automated Redshift Measurement
Abstract
We introduce the Spectroscopy Pre-trained Transformer (SpecPT), a transformer-based model designed to analyze spectroscopic data, with applications in spectrum reconstruction and redshift measurement. Using the Early Data Release (EDR) of the DESI survey, we evaluate SpecPT's performance on two distinct datasets: the Bright Galaxy Survey (BGS) and Emission Line Galaxy (ELG) samples. SpecPT successfully reconstructs spectra, accurately capturing emission lines, absorption features, and continuum shapes while effectively reducing noise. For redshift prediction, SpecPT achieves competitive accuracy, with Normalized Median Absolute Deviation (NMAD) values of 0.0006 and 0.0008, and catastrophic outlier fractions of 0.20% and 0.80% for BGS and ELG, respectively. Notably, SpecPT performs consistently well across the full redshift range ($0 < z < 1.6$), demonstrating its versatility and robustness. By leveraging its learned latent representations, SpecPT lays the groundwork for a foundational spectroscopic model, with potential applications in outlier detection, interstellar medium (ISM) property estimation, and transfer learning to other datasets. This work represents a first step in building a generalized framework for spectroscopic analysis, capable of scaling to the full DESI dataset and beyond.
- Publication:
-
arXiv e-prints
- Pub Date:
- January 2025
- DOI:
- arXiv:
- arXiv:2501.01070
- Bibcode:
- 2025arXiv250101070P
- Keywords:
-
- Astrophysics - Instrumentation and Methods for Astrophysics;
- Astrophysics - Astrophysics of Galaxies
- E-Print:
- 17 pages, 13 figures. Submitted to ApJ