4.7 Article

Enrichment analysis on regulatory subspaces: A novel direction for the superior description of cellular responses to SARS-CoV-2

Journal

COMPUTERS IN BIOLOGY AND MEDICINE
Volume 146, Issue -, Pages -

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.compbiomed.2022.105443

Keywords

SARS-CoV-2; Biclustering; Discriminative regulatory patterns; Transcriptomics; Machine learning; Computational biology; COVID-19

Funding

  1. Fundacao para a Ciencia e Tecnologia (FCT) under IPOscore [DSAIPA/DS/0042/2018]
  2. LAETA [UIDB/50022/2020]
  3. ILU [DSAIPA/DS/0111/2018]
  4. LAQV [UIDB/50006/2020, UIDP/50006/2020]
  5. INESC-ID plurianual [UIDB/50021/2020]
  6. FCT distinction [CEECIND/01399/2017]
  7. Fundação para a Ciência e a Tecnologia [DSAIPA/DS/0042/2018] Funding Source: FCT

Ask authors/readers for more resources

This study compares the role of different machine learning methods in studying the regulatory processes of cells affected by the SARS-CoV-2 virus. The results show that pattern-based biclustering algorithms have better performance in functional enrichment analysis and can aid in knowledge extraction. Furthermore, the comparative analysis of the results identifies potential pathophysiological characteristics of COVID-19 and compares them with other relevant studies.
Statement: Enrichment analysis of cell transcriptional responses to SARS-CoV-2 infection from biclustering solutions yields broader coverage and superior enrichment of GO terms and KEGG pathways against alternative state-of-the-art machine learning solutions, thus aiding knowledge extraction.Motivation and methods: The comprehensive understanding of the impacts of SARS-CoV-2 virus on infected cells is still incomplete. This work aims at comparing the role of state-of-the-art machine learning approaches in the study of cell regulatory processes affected and induced by the SARS-CoV-2 virus using transcriptomic data from both infectable cell lines available in public databases and in vivo samples. In particular, we assess the relevance of clustering, biclustering and predictive modeling methods for functional enrichment. Statistical principles to handle scarcity of observations, high data dimensionality, and complex gene interactions are further discussed. In particular, and without loos of generalization ability, the proposed methods are applied to study the differential regulatory response of lung cell lines to SARS-CoV-2 (alpha-variant) against RSV, IAV (H1N1), and HPIV3 viruses.Results: Gathered results show that, although clustering and predictive algorithms aid classic stances to functional enrichment analysis, more recent pattern-based biclustering algorithms significantly improve the number and quality of enriched GO terms and KEGG pathways with controlled false positive risks. Additionally, a comparative analysis of these results is performed to identify potential pathophysiological characteristics of COVID-19. These are further compared to those identified by other authors for the same virus as well as related ones such as SARS-CoV-1. The findings are particularly relevant given the lack of other works utilizing more complex machine learning algorithms within this context.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available