On the Coverage Required for Diploid Genome Assembly
Abstract
We investigate the information-theoretic conditions to achieve the complete reconstruction of a diploid genome. We also analyze the standard greedy and de-Bruijn graph-based algorithms and compare the coverage depth and read length requirements with the information-theoretic lower bound. Our results show that the gap between the two is considerable because both algorithms require the double repeats in the genome to be bridged.
- Publication:
-
arXiv e-prints
- Pub Date:
- May 2024
- DOI:
- 10.48550/arXiv.2405.05734
- arXiv:
- arXiv:2405.05734
- Bibcode:
- 2024arXiv240505734M
- Keywords:
-
- Computer Science - Information Theory;
- Quantitative Biology - Genomics
- E-Print:
- Accepted at ISIT'24