Machine Learning Pipeline for Pulsar Star Dataset
Abstract
This work brings together some of the most common machine learning (ML) algorithms, and the objective is to make a comparison at the level of obtained results from a set of unbalanced data. This dataset is composed of almost 17 thousand observations made to astronomical objects to identify pulsars (HTRU2). The methodological proposal based on evaluating the accuracy of these different models on the same database treated with two different strategies for unbalanced data. The results show that in spite of the noise and unbalance of classes present in this type of data, it is possible to apply them on standard ML algorithms and obtain promising accuracy ratios.
- Publication:
-
arXiv e-prints
- Pub Date:
- May 2020
- DOI:
- 10.48550/arXiv.2005.01208
- arXiv:
- arXiv:2005.01208
- Bibcode:
- 2020arXiv200501208Y
- Keywords:
-
- Astrophysics - Instrumentation and Methods for Astrophysics;
- Computer Science - Machine Learning