Estimating seed sensitivity on homogeneous alignments
Abstract
We address the problem of estimating the sensitivity of seed-based similarity search algorithms. In contrast to approaches based on Markov models [18, 6, 3, 4, 10], we study the estimation based on homogeneous alignments. We describe an algorithm for counting and random generation of those alignments and an algorithm for exact computation of the sensitivity for a broad class of seed strategies. We provide experimental results demonstrating a bias introduced by ignoring the homogeneousness condition.
- Publication:
-
arXiv e-prints
- Pub Date:
- March 2006
- DOI:
- 10.48550/arXiv.cs/0603106
- arXiv:
- arXiv:cs/0603106
- Bibcode:
- 2006cs........3106K
- Keywords:
-
- Computer Science - Other Computer Science
- E-Print:
- Proceedings of the Fourth IEEE Symposium on Bioinformatics and Bioengineering (BIBE), 387-394, 2004