Statistical distributions and entropy considerations in gene codes
Abstract
In our paper selected linguistic features of genomes to study the statistics of the gene codes are considered. We present the information theory from which it follows that if the system is described by distributions of hyperbolic type it leads to the possibility of entropy loss and stability. We show that the histograms of gene lengths are similar to that of language words. We show the correspondence between presented theory and results for the number of replicated genes and replicated fragments of genes in genomes for Borelia burgdorferi, Escherichia coli and Saccharomyces cerevisiae S288c.
- Publication:
-
arXiv e-prints
- Pub Date:
- July 2014
- arXiv:
- arXiv:1407.2269
- Bibcode:
- 2014arXiv1407.2269L
- Keywords:
-
- Quantitative Biology - Genomics;
- Physics - Biological Physics
- E-Print:
- 11 pages, 18 figures. arXiv admin note: text overlap with arXiv:1401.4561