☆ 4.6 Article

A large-scale phylogeny-guided analysis of pseudogenes in Pseudomonas aeruginosa bacterium

MICROBIOLOGY SPECTRUM (2023)

期刊

MICROBIOLOGY SPECTRUM

卷 11, 期 5, 页码 -

出版社

AMER SOC MICROBIOLOGY

DOI: 10.1128/spectrum.01704-23

关键词

pseudogenes; phylogenetics; bacteria; Pseudomonas aeruginosa; comparative genomics

类别

Microbiology

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This study analyzed the genomic data of Pseudomonas aeruginosa strains and found correlations between the number of pseudogenes and other genomic features. It identified clusters of orthologous genes and pseudogenes and examined their phylogenetic relationships. The study provides insights that can improve pseudogene annotation pipelines in the future.

Pseudogenes, once considered junk DNA based on the incorrect assumption that the absence of full coding potential means a complete lack of functionality, have recently become a subject of significant interest in the scientific community. Concurrently, it is widely assumed that bacterial genomes are compact and have a high density of coding genes with little room for non-coding genes, including pseudogenes. A key aspect of genome annotation is the correct identification of genes and the distinction between coding genes and pseudogenes, as it directly impacts functional and comparative genomics studies. In this study, we analyzed the genomic data of 4,699 strains of the bacterium Pseudomonas aeruginosa (P. aeruginosa) as they exhibit high variability in the number of annotated pseudogenes. In particular, we looked for correlations between the number of pseudogenes and other genomic and meta-features of the strains. We identified clusters of orthologous genes and pseudogenes and compared cluster size distributions and length homogeneity within clusters. We then mapped and examined orthology relationships between genes and pseudogenes. Additionally, we generated a phylogenetic tree of the strains and found that phylogenetically related strains are more homogeneous in the number of pseudogenes and share a significant amount of pseudogenes. Finally, we delved into clusters of orthologous genes and pseudogenes and quantified their phylogenetic neighborhood, classifying pseudogenes into evolutionary preserved pseudogenes, mis-annotated pseudogenes, or pseudogenes formed by failed horizontal transfer events. This in-depth study provides important insights that can be incorporated into pseudogene annotation pipelines in the future. IMPORTANCE Accurate annotation of genes and pseudogenes is vital for comparative genomics analysis. Recent studies have shown that bacterial pseudogenes have an important role in regulatory processes and can provide insight into the evolutionary history of homologous genes or the genome as a whole. Due to pseudogenes' nature as non-functional genes, there is no commonly accepted definition of a pseudogene, which poses difficulties in verifying the annotation through experimental methods and resolving discrepancies among different annotation techniques. Our study introduces an in-depth analysis of annotated genes and pseudogenes and insights that can be incorporated into improved pseudogene annotation pipelines in the future.

A large-scale phylogeny-guided analysis of pseudogenes in Pseudomonas aeruginosa bacterium

期刊

MICROBIOLOGY SPECTRUM

出版社

AMER SOC MICROBIOLOGY

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

A large-scale phylogeny-guided analysis of pseudogenes in Pseudomonas aeruginosa bacterium

期刊

MICROBIOLOGY SPECTRUM

出版社

AMER SOC MICROBIOLOGY

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文