4.7 Article

Predicting Survival Outcomes in the Presence of Unlabeled Data

期刊

MACHINE LEARNING
卷 111, 期 11, 页码 4139-4157

出版社

SPRINGER
DOI: 10.1007/s10994-022-06257-x

关键词

Survival analysis; Semi-supervised learning; Random survival forest; Self-training

资金

  1. KU Leuven Internal Funds [3M180314]
  2. Flemish Government (AI Research Program)

向作者/读者索取更多资源

This article explores the potential of including unlabeled data instances in survival analysis, proposing three approaches to handle this novel scenario and empirically demonstrating that all approaches can significantly improve predictive performance for survival times.
Many clinical studies require the follow-up of patients over time. This is challenging: apart from frequently observed drop-out, there are often also organizational and financial challenges, which can lead to reduced data collection and, in turn, can complicate subsequent analyses. In contrast, there is often plenty of baseline data available of patients with similar characteristics and background information, e.g., from patients that fall outside the study time window. In this article, we investigate whether we can benefit from the inclusion of such unlabeled data instances to predict accurate survival times. In other words, we introduce a third level of supervision in the context of survival analysis, apart from fully observed and censored instances, we also include unlabeled instances. We propose three approaches to deal with this novel setting and provide an empirical comparison over fifteen real-life clinical and gene expression survival datasets. Our results demonstrate that all approaches are able to increase the predictive performance over independent test data. We also show that integrating the partial supervision provided by censored data in a semi-supervised wrapper approach generally provides the best results, often achieving high improvements, compared to not using unlabeled data.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据