Network-based identification of disease genes in expression data: the GeneSurrounder method
Abstract
The advent of high--throughput transcription profiling technologies has enabled identification of genes and pathways associated with disease, providing new avenues for precision medicine. A key challenge is to analyze this data in the context of the regulatory networks and pathways that control cellular processes, while still obtaining insights that can be used to design new diagnostic and therapeutic interventions. While classical differential expression analysis provides specific and hence targetable gene-level insights, it does not include any systems-level information. On the other hand, pathway analyses integrate systems-level information with expression data, but are often limited in their ability to indicate specific molecular targets. We introduce GeneSurrounder, an analysis method that takes into account the complex structure of interaction networks to identify specific genes that disrupt pathway activity in a disease-specific manner. GeneSurrounder integrates transcriptomic data and pathway network information in a novel two-step procedure to detect genes that (i) appear to influence the expression of other genes local to it in the network and (ii) are part of a subnetwork of differentially expressed genes. Combined, this evidence can be used to pinpoint specific genes that have a mechanistic role in the phenotype of interest. Applying GeneSurrounder to three distinct ovarian cancer studies using a global KEGG network, we show that our method is able to identify biologically relevant genes and genes missed by single-gene association tests, integrate pathway and expression data, and yield more consistent results across multiple studies of the same phenotype than competing methods.
- Publication:
-
arXiv e-prints
- Pub Date:
- May 2017
- DOI:
- 10.48550/arXiv.1705.10922
- arXiv:
- arXiv:1705.10922
- Bibcode:
- 2017arXiv170510922S
- Keywords:
-
- Quantitative Biology - Quantitative Methods;
- Quantitative Biology - Genomics;
- Quantitative Biology - Molecular Networks;
- Statistics - Applications;
- Statistics - Computation
- E-Print:
- We have extended the application and evaluation of our GeneSurrounder method to a second disease (gene expression data from bladder cancer) and added additional analyses of GeneSurrounder's ability to identify known cancer-associated genes