Measuring the Stability of EHR- and EKG-based Predictive Models

doi:10.48550/arXiv.1812.00210

Measuring the Stability of EHR- and EKG-based Predictive Models

Databases of electronic health records (EHRs) are increasingly used to inform clinical decisions. Machine learning methods can find patterns in EHRs that are predictive of future adverse outcomes. However, statistical models may be built upon patterns of health-seeking behavior that vary across patient subpopulations, leading to poor predictive performance when training on one patient population and predicting on another. This note proposes two tests to better measure and understand model generalization. We use these tests to compare models derived from two data sources: (i) historical medical records, and (ii) electrocardiogram (EKG) waveforms. In a predictive task, we show that EKG-based models can be more stable than EHR-based models across different patient populations.

Publication:

arXiv e-prints

Pub Date:

December 2018

DOI:

10.48550/arXiv.1812.00210

arXiv:

arXiv:1812.00210

Bibcode:

2018arXiv181200210M

Keywords:

Statistics - Machine Learning;
Computer Science - Machine Learning

E-Print:

Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:cs/0101200

NASA/ADS

Measuring the Stability of EHR- and EKG-based Predictive Models

Abstract