4.2 Article

Identifying COVID-19-Specific Transcriptomic Biomarkers with Machine Learning Methods

期刊

BIOMED RESEARCH INTERNATIONAL
卷 2021, 期 -, 页码 -

出版社

HINDAWI LTD
DOI: 10.1155/2021/9939134

关键词

-

资金

  1. Strategic Priority Research Program of Chinese Academy of Sciences [XDB38050200]
  2. National Key R&D Program of China [2017YFC1201200, 2018YFC0910403]
  3. Shanghai Municipal Science and Technology Major Project [2017SHZDZX01]
  4. National Natural Science Foundation of China [31701151]
  5. Shanghai Sailing Program [16YF1413800]
  6. Youth Innovation Promotion Association of Chinese Academy of Sciences (CAS) [2016245]
  7. Fund of the Key Laboratory of Tissue Microenvironment and Tumor of Chinese Academy of Sciences [202002]

向作者/读者索取更多资源

COVID-19, caused by SARS-CoV-2, poses challenges in diagnosis due to presymptomatic and asymptomatic patients spreading the virus. New biomarkers are needed for screening and diagnosis, and machine learning models have been applied to identify host biomarkers associated with COVID-19 infection, improving the efficacy and accuracy of diagnosis.
COVID-19, a severe respiratory disease caused by a new type of coronavirus SARS-CoV-2, has been spreading all over the world. Patients infected with SARS-CoV-2 may have no pathogenic symptoms, i.e., presymptomatic patients and asymptomatic patients. Both patients could further spread the virus to other susceptible people, thereby making the control of COVID-19 difficult. The two major challenges for COVID-19 diagnosis at present are as follows: (1) patients could share similar symptoms with other respiratory infections, and (2) patients may not have any symptoms but could still spread the virus. Therefore, new biomarkers at different omics levels are required for the large-scale screening and diagnosis of COVID-19. Although some initial analyses could identify a group of candidate gene biomarkers for COVID-19, the previous work still could not identify biomarkers capable for clinical use in COVID-19, which requires disease-specific diagnosis compared with other multiple infectious diseases. As an extension of the previous study, optimized machine learning models were applied in the present study to identify some specific qualitative host biomarkers associated with COVID-19 infection on the basis of a publicly released transcriptomic dataset, which included healthy controls and patients with bacterial infection, influenza, COVID-19, and other kinds of coronavirus. This dataset was first analysed by Boruta, Max-Relevance and Min-Redundancy feature selection methods one by one, resulting in a feature list. This list was fed into the incremental feature selection method, incorporating one of the classification algorithms to extract essential biomarkers and build efficient classifiers and classification rules. The capacity of these findings to distinguish COVID-19 with other similar respiratory infectious diseases at the transcriptomic level was also validated, which may improve the efficacy and accuracy of COVID-19 diagnosis.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.2
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据