Algorithmic methods to infer the evolutionary trajectories in cancer progression
Abstract
A causality-based machine learning Pipeline for Cancer Inference (PiCnIc) is introduced to infer the underlying somatic evolution of ensembles of tumors from next-generation sequencing data. PiCnIc combines techniques for sample stratification, driver selection, and identification of fitness-equivalent exclusive alterations to exploit an algorithm based on Suppes' probabilistic causation. The accuracy and translational significance of the results are studied in detail, with an application to colorectal cancer. The PiCnIc pipeline has been made publicly accessible for reproducibility, interoperability, and future enhancements.
- Publication:
-
Proceedings of the National Academy of Science
- Pub Date:
- July 2016
- DOI:
- 10.1073/pnas.1520213113
- arXiv:
- arXiv:1509.07918
- Bibcode:
- 2016PNAS..113E4025C
- Keywords:
-
- Quantitative Biology - Genomics
- E-Print:
- doi:10.1073/pnas.1520213113