Learning to rank for censored survival data
Abstract
Survival analysis is a type of semisupervised ranking task where the target output (the survival time) is often rightcensored. Utilizing this information is a challenge because it is not obvious how to correctly incorporate these censored examples into a model. We study how three categories of loss functions, namely partial likelihood methods, rank methods, and our classification method based on a Wasserstein metric (WM) and the nonparametric Kaplan Meier estimate of the probability density to impute the labels of censored examples, can take advantage of this information. The proposed method allows us to have a model that predict the probability distribution of an event. If a clinician had access to the detailed probability of an event over time this would help in treatment planning. For example, determining if the risk of kidney graft rejection is constant or peaked after some time. Also, we demonstrate that this approach directly optimizes the expected Cindex which is the most common evaluation metric for ranking survival models.
 Publication:

arXiv eprints
 Pub Date:
 June 2018
 DOI:
 10.48550/arXiv.1806.01984
 arXiv:
 arXiv:1806.01984
 Bibcode:
 2018arXiv180601984L
 Keywords:

 Computer Science  Machine Learning;
 Computer Science  Artificial Intelligence;
 Statistics  Machine Learning