4.6 Article

Identifying Lung Cancer Cell Markers with Machine Learning Methods and Single-Cell RNA-Seq Data

期刊

LIFE-BASEL
卷 11, 期 9, 页码 -

出版社

MDPI
DOI: 10.3390/life11090940

关键词

lung cancer; random forest; decision tree; feature selection; cell biomarker; quantitative rules

资金

  1. Strategic Priority Research Program of Chinese Academy of Sciences [XDB38050200]
  2. National Key R&D Program of China [2018YFC0910403, 2017YFC1201200]
  3. Shanghai Municipal Science and Technology Major Project [2017SHZDZX01]
  4. National Natural Science Foundation of China [31701151]
  5. Shanghai Sailing Program [16YF1413800]
  6. Youth Innovation Promotion Association of Chinese Academy of Sciences (CAS) [2016245]
  7. Fund of the Key Laboratory of Tissue Microenvironment and Tumor of Chinese Academy of Sciences [202002]

向作者/读者索取更多资源

The study proposed a computational method to distinguish cell subtypes from different pathological regions of non-small cell lung cancer based on transcriptomic profiles. The random forest classifier and decision tree achieved high Matthew's correlation coefficients in this task.
Non-small cell lung cancer is a major lethal subtype of epithelial lung cancer, with high morbidity and mortality. The single-cell sequencing technique plays a key role in exploring the pathogenesis of non-small cell lung cancer. We proposed a computational method for distinguishing cell subtypes from the different pathological regions of non-small cell lung cancer on the basis of transcriptomic profiles, including a group of qualitative classification criteria (biomarkers) and various rules. The random forest classifier reached a Matthew's correlation coefficient (MCC) of 0.922 by using 720 features, and the decision tree reached an MCC of 0.786 by using 1880 features. The obtained biomarkers and rules were analyzed in the end of this study.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据