VEXAS: VISTA EXtension to Auxiliary Surveys. Data Release 2: Machine-learning based classification of sources in the Southern Hemisphere
Abstract
Context. We present the second public data release of the VISTA EXtension to Auxiliary Surveys (VEXAS), where we classify objects into stars, galaxies, and quasars based on an ensemble of machine learning algorithms.
Aims: The aim of VEXAS is to build the widest multi-wavelength catalogue, providing reference magnitudes, colours, and morphological information for a large number of scientific uses.
Methods: We applied an ensemble of thirty-two different machine learning models, based on three different algorithms and on different magnitude sets, training samples, and classification problems (two or three classes) on the three VEXAS Data Release 1 (DR1) optical and infrared (IR) tables. The tables were created in DR1 cross-matching VISTA near-infrared data with Wide-field Infrared Survey Explorer far-infrared data and with optical magnitudes from the Dark Energy Survey (VEXAS-DESW), the Sky Mapper Survey (VEXAS-SMW), and the Panoramic Survey Telescope and Rapid Response System Survey (VEXAS-PSW). We assembled a large table of spectroscopically confirmed objects (VEXAS-SPEC-GOOD, 415 628 unique objects), based on the combination of six different spectroscopic surveys that we used for training. We developed feature imputation to also classify objects for which magnitudes in one or more bands are missing.
Results: We classify in total ≈90 × 106 objects in the Southern Hemisphere. Among these, ≈62.9 × 106 (≈52.6 × 106) are classified as `high confidence' (`secure') stars, ≈920 000 (≈750 000) as `high confidence' (`secure') quasars, and ≈34.8 (≈34.1) million as `high confidence' (`secure') galaxies, with pclass ≥ 0.7 (pclass ≥ 0.9). The DR2 tables update the DR1 with the addition of imputed magnitudes and membership probabilities to each of the three classes.
Conclusions: The density of high-confidence extragalactic objects varies strongly with the survey depth: at pclass > 0.7, there are 11 deg−2 quasars in the VEXAS-DESW footprint and 103 deg−2 in the VEXAS-PSW footprint, while only 10.7 deg−2 in the VEXAS-SM footprint. Improved depth in the mid-infrared and coverage in the optical and near-infrared are needed for the SM footprint that is not already covered by DESW and PSW.
- Publication:
-
Astronomy and Astrophysics
- Pub Date:
- July 2021
- DOI:
- 10.1051/0004-6361/202040131
- arXiv:
- arXiv:2103.09257
- Bibcode:
- 2021A&A...651A..69K
- Keywords:
-
- astronomical databases: miscellaneous;
- catalogs;
- surveys;
- methods: data analysis;
- virtual observatory tools;
- Astrophysics - Astrophysics of Galaxies;
- Astrophysics - Instrumentation and Methods for Astrophysics
- E-Print:
- 25 pages, 18 figures, 8 tables. Accepted for publication on A&