$K$-Means and Gaussian Mixture Modeling with a Separation Constraint
Abstract
We consider the problem of clustering with $K$-means and Gaussian mixture models with a constraint on the separation between the centers in the context of real-valued data. We first propose a dynamic programming approach to solving the $K$-means problem with a separation constraint on the centers, building on (Wang and Song, 2011). In the context of fitting a Gaussian mixture model, we then propose an EM algorithm that incorporates such a constraint. A separation constraint can help regularize the output of a clustering algorithm, and we provide both simulated and real data examples to illustrate this point.
- Publication:
-
arXiv e-prints
- Pub Date:
- July 2020
- DOI:
- 10.48550/arXiv.2007.04586
- arXiv:
- arXiv:2007.04586
- Bibcode:
- 2020arXiv200704586J
- Keywords:
-
- Statistics - Computation
- E-Print:
- 16 pages, 6 tables, 1 figure with 3 subfigures