Info-Clustering: A Mathematical Theory for Data Clustering
Abstract
We formulate an info-clustering paradigm based on a multivariate information measure, called multivariate mutual information, that naturally extends Shannon's mutual information between two random variables to the multivariate case involving more than two random variables. With proper model reductions, we show that the paradigm can be applied to study the human genome and connectome in a more meaningful way than the conventional algorithmic approach. Not only can info-clustering provide justifications and refinements to some existing techniques, but it also inspires new computationally feasible solutions.
- Publication:
-
arXiv e-prints
- Pub Date:
- May 2016
- DOI:
- 10.48550/arXiv.1605.01233
- arXiv:
- arXiv:1605.01233
- Bibcode:
- 2016arXiv160501233C
- Keywords:
-
- Computer Science - Information Theory;
- Quantitative Biology - Genomics;
- Quantitative Biology - Neurons and Cognition
- E-Print:
- In celebration of Claude Shannon's Centenary