4.6 Review

Bioinformatic Analysis of Temporal and Spatial Proteome Alternations During Infections

期刊

FRONTIERS IN GENETICS
卷 12, 期 -, 页码 -

出版社

FRONTIERS MEDIA SA
DOI: 10.3389/fgene.2021.667936

关键词

temporal proteomics; spatial proteomics; host-pathogen interactions; clustering; principal component analysis; self-organizing maps; data imputation; normalization

资金

  1. CIHR Postdoctoral Fellowship
  2. Natural Sciences and Engineering Research Council [DG-20234]

向作者/读者索取更多资源

Microbial pathogens have evolved diverse strategies to manipulate host systems, causing diseases through changes in the host-pathogen proteome. Mass spectrometry-based proteomics approaches are utilized to study disease progression but present challenges in data analysis. The study reviews steps in temporal and spatial analysis, offering best practices for data preprocessing, statistical analysis, and biological information extraction. Guidance is provided for novices and established users, with future directions and data analysis codes available for testing.
Microbial pathogens have evolved numerous mechanisms to hijack host's systems, thus causing disease. This is mediated by alterations in the combined host-pathogen proteome in time and space. Mass spectrometry-based proteomics approaches have been developed and tailored to map disease progression. The result is complex multidimensional data that pose numerous analytic challenges for downstream interpretation. However, a systematic review of approaches for the downstream analysis of such data has been lacking in the field. In this review, we detail the steps of a typical temporal and spatial analysis, including data pre-processing steps (i.e., quality control, data normalization, the imputation of missing values, and dimensionality reduction), different statistical and machine learning approaches, validation, interpretation, and the extraction of biological information from mass spectrometry data. We also discuss current best practices for these steps based on a collection of independent studies to guide users in selecting the most suitable strategies for their dataset and analysis objectives. Moreover, we also compiled the list of commonly used R software packages for each step of the analysis. These could be easily integrated into one's analysis pipeline. Furthermore, we guide readers through various analysis steps by applying these workflows to mock and host-pathogen interaction data from public datasets. The workflows presented in this review will serve as an introduction for data analysis novices, while also helping established users update their data analysis pipelines. We conclude the review by discussing future directions and developments in temporal and spatial proteomics and data analysis approaches. Data analysis codes, prepared for this review are available from https://github.com/BabuLab-UofR/TempSpac, where guidelines and sample datasets are also offered for testing purposes.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据