Efficient and accurate causal inference with hidden confounders from genome-transcriptome variation data
Abstract
Mapping gene expression as a quantitative trait using whole genome-sequencing and transcriptome analysis allows to discover the functional consequences of genetic variation. We developed a novel method and ultra-fast software Findr for higly accurate causal inference between gene expression traits using cis-regulatory DNA variations as causal anchors, which improves current methods by taking into account hidden confounders and weak regulations. Findr outperformed existing methods on the DREAM5 Systems Genetics challenge and on the prediction of microRNA and transcription factor targets in human lymphoblastoid cells, while being nearly a million times faster. Findr is publicly available at https://github.com/lingfeiwang/findr
- Publication:
-
PLoS Computational Biology
- Pub Date:
- August 2017
- DOI:
- 10.1371/journal.pcbi.1005703
- arXiv:
- arXiv:1611.01114
- Bibcode:
- 2017PLSCB..13E5703W
- Keywords:
-
- Quantitative Biology - Genomics;
- Quantitative Biology - Molecular Networks;
- Quantitative Biology - Quantitative Methods
- E-Print:
- New result and method sections added. 38 pages, 4 figures, 1 table. Supplementary: 20 pages, 10 figures, 2 tables