4.6 Article

Evaluation of the dependence of radiomic features on the machine learning model

期刊

INSIGHTS INTO IMAGING
卷 13, 期 1, 页码 -

出版社

SPRINGER
DOI: 10.1186/s13244-022-01170-2

关键词

Radiomics; Feature relevance; Biomarkers; Feature selection; Machine learning

资金

  1. Projekt DEAL

向作者/读者索取更多资源

In radiomic studies, the features selected by different models can vary greatly, even among statistically similar models. It is misleading to consider features selected by a single model, making it difficult to determine if these features are candidate biomarkers.
Background In radiomic studies, several models are often trained with different combinations of feature selection methods and classifiers. The features of the best model are usually considered relevant to the problem, and they represent potential biomarkers. Features selected from statistically similarly performing models are generally not studied. To understand the degree to which the selected features of these statistically similar models differ, 14 publicly available datasets, 8 feature selection methods, and 8 classifiers were used in this retrospective study. For each combination of feature selection and classifier, a model was trained, and its performance was measured with AUC-ROC. The best-performing model was compared to other models using a DeLong test. Models that were statistically similar were compared in terms of their selected features. Results Approximately 57% of all models analyzed were statistically similar to the best-performing model. Feature selection methods were, in general, relatively unstable (0.58; range 0.35-0.84). The features selected by different models varied largely (0.19; range 0.02-0.42), although the selected features themselves were highly correlated (0.71; range 0.4-0.92). Conclusions Feature relevance in radiomics strongly depends on the model used, and statistically similar models will generally identify different features as relevant. Considering features selected by a single model is misleading, and it is often not possible to directly determine whether such features are candidate biomarkers.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据