4.7 Article

Ensemble feature selection for stable biomarker identification and cancer classification from microarray expression data

期刊

COMPUTERS IN BIOLOGY AND MEDICINE
卷 142, 期 -, 页码 -

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.compbiomed.2021.105208

关键词

Feature selection; Gene expression profiles; Stability; Ensemble learning

资金

  1. Guangdong Basic and Applied Basic Research Foundation [2020A1515011499]
  2. Natural Science Foundation of China [62176082]

向作者/读者索取更多资源

In this study, an ensemble feature selection framework is proposed to improve the discrimination and stability of features. By using sampling and aggregation strategies, accurate feature selection is achieved in small sample and high dimensionality scenarios, leading to improved diagnostic accuracy and understanding of disease mechanisms.
Microarray technology facilitates the simultaneous measurement of expression of tens of thousands of genes and enables us to study cancers and tumors at the molecular level. Because microarray data are typically characterized by small sample size and high dimensionality, accurate and stable feature selection is thus of fundamental importance to the diagnostic accuracy and deep understanding of disease mechanism. Hence, we in this study present an ensemble feature selection framework to improve the discrimination and stability of finally selected features. Specifically, we utilize sampling techniques to obtain multiple sampled datasets, from each of which we use a base feature selector to select a subset of features. Afterwards, we develop two aggregation strategies to combine multiple feature subsets into one set. Finally, comparative experiments are conducted on four publicly available microarray datasets covering both binary and multi-class cases in terms of classification accuracy and three stability metrics. Results show that the proposed method obtains better stability scores and achieves comparable to and even better classification performance than its competitors.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据