Machine Learning Techniques for Stellar Light Curve Classification
Abstract
We apply machine learning techniques in an attempt to predict and classify stellar properties from noisy and sparse time-series data. We preprocessed over 94 GB of Kepler light curves from the Mikulski Archive for Space Telescopes (MAST) to classify according to 10 distinct physical properties using both representation learning and feature engineering approaches. Studies using machine learning in the field have been primarily done on simulated data, making our study one of the first to use real light-curve data for machine learning approaches. We tuned our data using previous work with simulated data as a template and achieved mixed results between the two approaches. Representation learning using a long short-term memory recurrent neural network produced no successful predictions, but our work with feature engineering was successful for both classification and regression. In particular, we were able to achieve values for stellar density, stellar radius, and effective temperature with low error (∼2%-4%) and good accuracy (∼75%) for classifying the number of transits for a given star. The results show promise for improvement for both approaches upon using larger data sets with a larger minority class. This work has the potential to provide a foundation for future tools and techniques to aid in the analysis of astrophysical data.
- Publication:
-
The Astronomical Journal
- Pub Date:
- July 2018
- DOI:
- arXiv:
- arXiv:1710.06804
- Bibcode:
- 2018AJ....156....7H
- Keywords:
-
- methods: data analysis;
- planetary systems;
- planets and satellites: detection;
- stars: general;
- techniques: image processing;
- Astrophysics - Instrumentation and Methods for Astrophysics;
- 85
- E-Print:
- Accepted to The Astronomical Journal