Exoplanet validation with machine learning: 50 new validated Kepler planets

doi:10.1093/mnras/staa2498

Exoplanet validation with machine learning: 50 new validated Kepler planets

Over 30 per cent of the $\sim$4000 known exoplanets to date have been discovered using 'validation', where the statistical likelihood of a transit arising from a false positive (FP), non-planetary scenario is calculated. For the large majority of these validated planets calculations were performed using the VESPA algorithm. Regardless of the strengths and weaknesses of VESPA, it is highly desirable for the catalogue of known planets not to be dependent on a single method. We demonstrate the use of machine learning algorithms, specifically a Gaussian process classifier (GPC) reinforced by other models, to perform probabilistic planet validation incorporating prior probabilities for possible FP scenarios. The GPC can attain a mean log-loss per sample of 0.54 when separating confirmed planets from FPs in the Kepler Threshold-Crossing Event (TCE) catalogue. Our models can validate thousands of unseen candidates in seconds once applicable vetting metrics are calculated, and can be adapted to work with the active Transiting Exoplanet Survey Satellite (TESS) mission, where the large number of observed targets necessitate the use of automated algorithms. We discuss the limitations and caveats of this methodology, and after accounting for possible failure modes newly validate 50 Kepler candidates as planets, sanity checking the validations by confirming them with VESPA using up to date stellar information. Concerning discrepancies with VESPA arise for many other candidates, which typically resolve in favour of our models. Given such issues, we caution against using single-method planet validation with either method until the discrepancies are fully understood.

Publication:

Monthly Notices of the Royal Astronomical Society

Pub Date:

July 2021

DOI:

10.1093/mnras/staa2498

10.48550/arXiv.2008.10516

arXiv: