Random survival forests
Abstract
We introduce random survival forests, a random forests method for the analysis of right-censored survival data. New survival splitting rules for growing survival trees are introduced, as is a new missing data algorithm for imputing missing data. A conservation-of-events principle for survival forests is introduced and used to define ensemble mortality, a simple interpretable measure of mortality that can be used as a predicted outcome. Several illustrative examples are given, including a case study of the prognostic implications of body mass for individuals with coronary artery disease. Computations for all examples were implemented using the freely available R-software package, randomSurvivalForest.
- Publication:
-
arXiv e-prints
- Pub Date:
- November 2008
- DOI:
- 10.48550/arXiv.0811.1645
- arXiv:
- arXiv:0811.1645
- Bibcode:
- 2008arXiv0811.1645I
- Keywords:
-
- Statistics - Applications
- E-Print:
- Published in at http://dx.doi.org/10.1214/08-AOAS169 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)