Subtree power analysis finds optimal species for comparative genomics
Abstract
Sequence comparison across multiple organisms aids in the detection of regions under selection. However, resource limitations require a prioritization of genomes to be sequenced. This prioritization should be grounded in two considerations: the lineal scope encompassing the biological phenomena of interest, and the optimal species within that scope for detecting functional elements. We introduce a statistical framework for optimal species subset selection, based on maximizing power to detect conserved sites. In a study of vertebrate species, we show that the optimal species subset is not in general the most evolutionarily diverged subset. Our results suggest that marsupials are prime sequencing candidates.
- Publication:
-
arXiv e-prints
- Pub Date:
- December 2004
- DOI:
- 10.48550/arXiv.q-bio/0412012
- arXiv:
- arXiv:q-bio/0412012
- Bibcode:
- 2004q.bio....12012M
- Keywords:
-
- Genomics;
- Quantitative Methods
- E-Print:
- 16 pages, 3 figures, 3 tables