Entropy of finite random binary sequences with weak long-range correlations
Abstract
We study the N -step binary stationary ergodic Markov chain and analyze its differential entropy. Supposing that the correlations are weak we express the conditional probability function of the chain through the pair correlation function and represent the entropy as a functional of the pair correlator. Since the model uses the two-point correlators instead of the block probability, it makes it possible to calculate the entropy of strings at much longer distances than using standard methods. A fluctuation contribution to the entropy due to finiteness of random chains is examined. This contribution can be of the same order as its regular part even at the relatively short lengths of subsequences. A self-similar structure of entropy with respect to the decimation transformations is revealed for some specific forms of the pair correlation function. Application of the theory to the DNA sequence of the R3 chromosome of Drosophila melanogaster is presented.
- Publication:
-
Physical Review E
- Pub Date:
- November 2014
- DOI:
- arXiv:
- arXiv:1502.07363
- Bibcode:
- 2014PhRvE..90e2106M
- Keywords:
-
- 05.40.-a;
- 02.50.Ga;
- Fluctuation phenomena random processes noise and Brownian motion;
- Markov processes;
- Condensed Matter - Statistical Mechanics;
- Condensed Matter - Disordered Systems and Neural Networks;
- Computer Science - Information Theory;
- Physics - Data Analysis;
- Statistics and Probability
- E-Print:
- 9 pages, 4 figures. arXiv admin note: substantial text overlap with arXiv:1411.2761, arXiv:1412.3692