STACCATO: a novel solution to supernova photometric classification with biased training sets
Abstract
We present a new solution to the problem of classifying Type Ia supernovae from their light curves alone given a spectroscopically confirmed but biased training set, circumventing the need to obtain an observationally expensive unbiased training set. We use Gaussian processes (GPs) to model the supernovae's (SN's) light curves, and demonstrate that the choice of covariance function has only a small influence on the GPs ability to accurately classify SNe. We extend and improve the approach of Richards et al. - a diffusion map combined with a random forest classifier - to deal specifically with the case of biased training sets. We propose a novel method called Synthetically Augmented Light Curve Classification (STACCATO) that synthetically augments a biased training set by generating additional training data from the fitted GPs. Key to the success of the method is the partitioning of the observations into subgroups based on their propensity score of being included in the training set. Using simulated light curve data, we show that STACCATO increases performance, as measured by the area under the Receiver Operating Characteristic curve (AUC), from 0.93 to 0.96, close to the AUC of 0.977 obtained using the 'gold standard' of an unbiased training set and significantly improving on the previous best result of 0.88. STACCATO also increases the true positive rate for SNIa classification by up to a factor of 50 for high-redshift/low-brightness SNe.
- Publication:
-
Monthly Notices of the Royal Astronomical Society
- Pub Date:
- January 2018
- DOI:
- 10.1093/mnras/stx2570
- arXiv:
- arXiv:1706.03811
- Bibcode:
- 2018MNRAS.473.3969R
- Keywords:
-
- methods: statistical;
- supernovae: general;
- cosmological parameters;
- distance scale;
- cosmology: observations;
- Astrophysics - Instrumentation and Methods for Astrophysics
- E-Print:
- Matches version accepted by MNRAS, v3 only changes metadata to point to zenodo repository for the code. The STACCATO code is available from https://doi.org/10.5281/zenodo.3701464