Toric ideals with linear components: an algebraic interpretation of clustering the cells of a contingency table
Abstract
In this paper we show that the agglomeration of rows or columns of a contingency table with a hierarchical clustering algorithm yields statistical models defined through toric ideals. In particular, starting from the classical independence model, the agglomeration process adds a linear part to the toric ideal generated by the $2 \times 2$ minors.
- Publication:
-
arXiv e-prints
- Pub Date:
- September 2013
- DOI:
- 10.48550/arXiv.1309.7622
- arXiv:
- arXiv:1309.7622
- Bibcode:
- 2013arXiv1309.7622C
- Keywords:
-
- Mathematics - Statistics Theory;
- Mathematics - Commutative Algebra;
- Statistics - Methodology;
- 62H17;
- 62H30;
- 14M25;
- 15B36
- E-Print:
- 17 pages, 1 figure