Causal inference in genetic trio studies
Abstract
The goal of genome-wide association studies is to identify meaningful relationships between genotypes and outcomes of interest. One challenge in the analysis of genetic data is that not all true statistical associations represent relevant biological activity; irrelevant but true associations can arise from the confounding effect of environmental conditions or other factors. We propose a method to analyze such data that is immune to this problem because it uses the variation in inheritance as a randomized experiment. The method can leverage any machine-learning algorithm as well as findings from other studies.
- Publication:
-
Proceedings of the National Academy of Science
- Pub Date:
- September 2020
- DOI:
- 10.1073/pnas.2007743117
- arXiv:
- arXiv:2002.09644
- Bibcode:
- 2020PNAS..11724117B
- Keywords:
-
- Statistics - Methodology;
- Statistics - Applications
- E-Print:
- Proc. Natl. Acad. Sci. U.S.A. 177 (2020) 24117-24126