Information weights of nucleotides in DNA sequences
Abstract
The coding sequence in DNA molecule is considered as a message to be transferred to receiver, the proteins, through a noisy information channel and each nucleotide is assigned a respective information weight. With the help of the nucleotide substitution matrix we estimated the lower bound of the amount of information carried out by nucleotides which is not subject of mutations. We used the calculated weights to reconstruct k-oligomers of genes from the Borrelia burgdorferi genome. We showed, that to this aim there is sufficient a simple rule, that the number of bits of the carried information cannot exceed some threshold value. The method introduced by us is general and applies to every genome.
- Publication:
-
arXiv e-prints
- Pub Date:
- January 2003
- DOI:
- arXiv:
- arXiv:cond-mat/0301371
- Bibcode:
- 2003cond.mat..1371D
- Keywords:
-
- Condensed Matter - Soft Condensed Matter;
- Quantitative Biology
- E-Print:
- 8 pages, 7 figures, submitted for publication