A review of unsupervised learning in astronomy
Abstract
This review summarises popular unsupervised learning methods, and gives an overview of their past, current, and future uses in astronomy. Unsupervised learning aims to organise the information content of a dataset, in such a way that knowledge can be extracted. Traditionally this has been achieved through dimensionality reduction techniques that aid the ranking of a dataset, for example through principal component analysis or by using auto-encoders, or simpler visualisation of a high dimensional space, for example through the use of a self organising map. Other desirable properties of unsupervised learning include the identification of clusters, i.e. groups of similar objects, which has traditionally been achieved by the k-means algorithm and more recently through density-based clustering such as HDBSCAN. More recently, complex frameworks have emerged, that chain together dimensionality reduction and clustering methods. However, no dataset is fully unknown. Thus, nowadays a lot of research has been directed towards self-supervised and semi-supervised methods that stand to gain from both supervised and unsupervised learning.
- Publication:
-
Astronomy and Computing
- Pub Date:
- July 2024
- DOI:
- arXiv:
- arXiv:2406.17316
- Bibcode:
- 2024A&C....4800851F
- Keywords:
-
- Unsupervised learning;
- Machine learning;
- Data intensive astronomy;
- Extragalactic astronomy;
- Astrophysics - Instrumentation and Methods for Astrophysics;
- Computer Science - Machine Learning
- E-Print:
- 30 pages, 6 figures. Invited contribution to special issue in Astronomy &