A Simple Approach to Sparse Clustering
Abstract
Consider the problem of sparse clustering, where it is assumed that only a subset of the features are useful for clustering purposes. In the framework of the COSA method of Friedman and Meulman, subsequently improved in the form of the Sparse K-means method of Witten and Tibshirani, a natural and simpler hill-climbing approach is introduced. The new method is shown to be competitive with these two methods and others.
- Publication:
-
arXiv e-prints
- Pub Date:
- February 2016
- DOI:
- 10.48550/arXiv.1602.07277
- arXiv:
- arXiv:1602.07277
- Bibcode:
- 2016arXiv160207277A
- Keywords:
-
- Statistics - Machine Learning
- E-Print:
- Computational Statistics &