A Probabilistic Model For Sequence Analysis
Abstract
This paper presents a probabilistic approach for DNA sequence analysis. A DNA sequence consists of an arrangement of the four nucleotides A, C, T and G and different representation schemes are presented according to a probability measure associated with them. There are different ways that probability can be associated with the DNA sequence: one way is when the probability of an occurrence of a letter does not depend on the previous one (termed as unsuccessive probability) and in another scheme the probability of occurrence of a letter depends on its previous letter (termed as successive probability). Further, based on these probability measures graphical representations of the schemes are also presented. Using the diagram probability measure one can easily calculate an associated probability measure which can serve as a parameter to check how close is a new sequence to already existing ones.
- Publication:
-
arXiv e-prints
- Pub Date:
- February 2010
- DOI:
- 10.48550/arXiv.1002.2412
- arXiv:
- arXiv:1002.2412
- Bibcode:
- 2010arXiv1002.2412P
- Keywords:
-
- Quantitative Biology - Quantitative Methods;
- Computer Science - Computational Engineering;
- Finance;
- and Science
- E-Print:
- IEEE format, International Journal of Computer Science and Information Security, IJCSIS January 2010, ISSN 1947 5500, http://sites.google.com/site/ijcsis/