Deep-learning real/bogus classification for the Tomo-e Gozen transient survey
Abstract
We present a deep neural network real/bogus classifier that improves classification performance in the Tomo-e Gozen Transient survey by handling label errors in the training data. In the wide-field, high-frequency transient survey with Tomo-e Gozen, the performance of conventional convolutional neural network classifiers is not sufficient as about 106 bogus detections appear every night. In need of a better classifier, we have developed a new two-stage training method. In this training method, label errors in the training data are first detected by normal supervised learning classification, and then they are unlabeled and used for training of semi-supervised learning. For actual observed data, the classifier with this method achieves an area under the curve (AUC) of 0.9998 and a false positive rate (FPR) of 0.0002 at a true positive rate (TPR) of 0.9. This training method saves relabeling effort by humans and works better on training data with a high fraction of label errors. By implementing the developed classifier in the Tomo-e Gozen pipeline, the number of transient candidates was reduced to ~40 objects per night, which is ~1/130 of the previous version, while maintaining the recovery rate of real transients. This enables more efficient selection of targets for follow-up observations.
- Publication:
-
Publications of the Astronomical Society of Japan
- Pub Date:
- August 2022
- DOI:
- 10.1093/pasj/psac047
- arXiv:
- arXiv:2206.12478
- Bibcode:
- 2022PASJ...74..946T
- Keywords:
-
- methods: statistical;
- supernovae: general;
- surveys;
- Astrophysics - Instrumentation and Methods for Astrophysics;
- Astrophysics - High Energy Astrophysical Phenomena
- E-Print:
- 14 pages, 17 figures, 2 tables. Published in PASJ. The source code is available at https://github.com/ichiro-takahashi/tomoe-realbogus