Predicting the Biological Classification of Cell-Cycle Regulated Genes of Saccharomyces cerevisiae using Community Detection Algorithms on Gene Co-expression Networks
Abstract
The conventional approach for analyzing gene expression data involves clustering algorithms. Cluster analyses provide partitioning of the set of genes that can predict biological classification based on its similarity in n-dimensional space. In this study, we investigate whether network analysis will provide an advantage over the traditional approach. We identify the advantages and disadvantages of using the value-based and the rank-based construction in creating a graph representation of the original gene-expression data in a time-series format. We tested four community detection algorithms, namely, the Clauset-Newman-Moore (greedy), Louvain, Leiden, and Girvan-Newman algorithms in predicting the 5 functional groups of genes. We used the Adjusted Rand Index to assess the quality of the predicted communities with respect to the biological classifications. We showed that Girvan-Newman outperforms the 3 modularity-based algorithms in both value-based and ranked-based constructed graphs. Moreover, we also show that when compared to the conventional clustering algorithms such as K-means, Spectral, Birch, and Agglomerative algorithms, we obtained a higher ARI with Girvan-Newman. This study also provides a tool for graph construction, visualization, and community detection for further analysis of gene expression data.
- Publication:
-
arXiv e-prints
- Pub Date:
- August 2022
- DOI:
- 10.48550/arXiv.2208.10119
- arXiv:
- arXiv:2208.10119
- Bibcode:
- 2022arXiv220810119C
- Keywords:
-
- Quantitative Biology - Molecular Networks;
- 05C85;
- F.2.0
- E-Print:
- 11 pages, Philippine Computing Journal Vol 16 No. 1