iTARGET: Interpretable Tailored Age Regression for Grouped Epigenetic Traits
Abstract
Accurately predicting chronological age from DNA methylation patterns is crucial for advancing biological age estimation. However, this task is made challenging by Epigenetic Correlation Drift (ECD) and Heterogeneity Among CpGs (HAC), which reflect the dynamic relationship between methylation and age across different life stages. To address these issues, we propose a novel two-phase algorithm. The first phase employs similarity searching to cluster methylation profiles by age group, while the second phase uses Explainable Boosting Machines (EBM) for precise, group-specific prediction. Our method not only improves prediction accuracy but also reveals key age-related CpG sites, detects age-specific changes in aging rates, and identifies pairwise interactions between CpG sites. Experimental results show that our approach outperforms traditional epigenetic clocks and machine learning models, offering a more accurate and interpretable solution for biological age estimation with significant implications for aging research.
- Publication:
-
arXiv e-prints
- Pub Date:
- January 2025
- DOI:
- arXiv:
- arXiv:2501.02401
- Bibcode:
- 2025arXiv250102401W
- Keywords:
-
- Quantitative Biology - Genomics;
- Computer Science - Artificial Intelligence;
- 62P10;
- 92D20;
- 92D10;
- I.5.4;
- J.3;
- I.2.6
- E-Print:
- To be published in IEEE BIBM 2024.The manuscript includes a comprehensive description of the methodology and comparison with traditional epigenetic clocks and machine learning models. Submitted to arXiv as part of ongoing research in epigenetics and aging studies