On the Theory and Algorithm for rigorous discretization in applications of Information Theory
Abstract
We identify fundamental issues with discretization when estimating information-theoretic quantities in the analysis of data. These difficulties are theoretical in nature and arise with discrete datasets carrying significant implications for the corresponding claims and results. Here we describe the origins of the methodological problems, and provide a clear illustration of their impact with the example of biological network reconstruction. We propose an algorithm (shared information metric) that corrects for the biases and the resulting improved performance of the algorithm demonstrates the need to take due consideration of this issue in different contexts.
- Publication:
-
arXiv e-prints
- Pub Date:
- June 2014
- DOI:
- 10.48550/arXiv.1406.5104
- arXiv:
- arXiv:1406.5104
- Bibcode:
- 2014arXiv1406.5104K
- Keywords:
-
- Quantitative Biology - Quantitative Methods