4.8 Article

proGenomes3: approaching one million accurately and consistently annotated high-quality prokaryotic genomes

Related references

Note: Only part of the references are listed.
Article Biochemical Research Methods

metaSNV v2: detection of SNVs and subspecies in prokaryotic metagenomes

Thea Van Rossum et al.

Summary: This study introduces a method to identify and profile subspecies in metagenomes based on single nucleotide variant (SNV) patterns within species, extending existing SNV-calling software. These new features support microbiome analyses to link SNV profiles with host phenotype or environment and niche-specificity. Subspecies identification was demonstrated in marine and fecal metagenomes, with findings supporting a common subspecies population structure in the human gut microbiome and illustrating some limits in subspecies calling.

BIOINFORMATICS (2022)

Article Multidisciplinary Sciences

Towards the biogeography of prokaryotic genes

Luis Pedro Coelho et al.

Summary: The majority of microbial genes are specific to a single habitat, with a small fraction found in multiple habitats enriched in antibiotic-resistance genes and markers for mobile genetic elements. A small fraction of protein families contain the majority of genes, with most genetic variability observed within the families being neutral or nearly neutral.

NATURE (2022)

Article Biochemistry & Molecular Biology

Database resources of the national center for biotechnology information

Eric W. Sayers et al.

Summary: The National Center for Biotechnology Information (NCBI) produces a variety of online information resources for biology, including databases for nucleic acid sequences and life science journal citations. It provides search and retrieval operations for most of these data from 35 distinct databases, with E-utilities serving as the programming interface. Several resources received significant updates in the past year.

NUCLEIC ACIDS RESEARCH (2022)

Article Biochemistry & Molecular Biology

BacDive in 2022: the knowledge base for standardized bacterial and archaeal data

Lorenz Christian Reimer et al.

Summary: BacDive, the bacterial metadatabase, has become a leading database for standardized prokaryotic data on strain level, offering a wealth of information for bacterial and archaeal strains. The database has seen a 30% increase in data over the past three years, with new features such as a query builder tool and an interactive dashboard for statistical overview. Improved genomic sequence data and integration with other databases further enhance the usability and accessibility of the data.

NUCLEIC ACIDS RESEARCH (2022)

Article Biochemistry & Molecular Biology

GTDB: an ongoing census of bacterial and archaeal diversity through a phylogenetically consistent, rank normalized and complete genome-based taxonomy

Donovan H. Parks et al.

Summary: The Genome Taxonomy Database (GTDB) provides a phylogenetically consistent taxonomy for prokaryotic genomes sourced from the NCBI database. It includes a large number of bacterial and archaeal genomes, highlights the importance of metagenome-assembled genomes, and discusses improvements to the GTDB website and the procedure for updating species clusters.

NUCLEIC ACIDS RESEARCH (2022)

Article Biochemistry & Molecular Biology

Landscape of mobile genetic elements and their antibiotic resistance cargo in prokaryotic genomes

Supriya Khedkar et al.

Summary: Prokaryotic Mobile Genetic Elements (MGEs) play important roles in evolution and the spread of antibiotic resistance. However, current understanding of their global dispersal is limited. In this study, a computational framework was developed to capture different MGE types and their cargos, allowing for a better understanding of MGE dispersal. The findings were integrated into a resource, providing a valuable tool for future research on the mobile part of genomes and its horizontal dispersal.

NUCLEIC ACIDS RESEARCH (2022)

Article Multidisciplinary Sciences

Biosynthetic potential of the global ocean microbiome

Lucas Paoli et al.

Summary: Natural microbial communities are diverse and offer great potential for the discovery of enzymes and biochemical compounds. However, studying this diversity and assigning the synthesis of compounds to their hosts is challenging. In this study, we integrated microbial genomes from various sources and discovered thousands of new biosynthetic gene clusters, including in previously unsuspected phylogenetic groups. We identified a lineage rich in biosynthetic gene clusters and characterized the structures and enzymology of bioactive compounds. This research demonstrates the value of microbiomics-driven strategies in exploring previously undescribed enzymes and natural products.

NATURE (2022)

Article Biology

Ancestral reconstruction of duplicated signaling proteins reveals the evolution of signaling specificity

Isabel Nocedal et al.

Summary: Gene duplication is important for generating new signaling pathways during evolution. In this study, the researchers used ancestral sequence reconstruction to resurrect a bacterial two-component signaling system that duplicated in alpha-proteobacteria. They determined the interaction specificities of the signaling proteins before and after the duplication event and identified key mutations responsible for establishing specificity in the two systems. These findings suggest that protein-protein interactions with latent potential may be easily duplicated and diverged.

ELIFE (2022)

Article Biochemistry & Molecular Biology

Drivers and determinants of strain dynamics following fecal microbiota transplantation

Thomas S. B. Schmidt et al.

Summary: Through the analysis of fecal microbiota transplantation (FMT), it was found that recipient factors and donor-recipient complementarity were the main determinants of strain population dynamics. The application of ecology-based framework can help develop more effective microbiome therapies and enhance donor microbiota colonization or displacement of recipient microbes in clinical practice.

NATURE MEDICINE (2022)

Article Biotechnology & Applied Microbiology

inStrain profiles population microdiversity from metagenomic data and sensitively detects shared microbial strains

Matthew R. Olm et al.

Summary: The program inStrain is used to study genetic diversity in microbial populations, particularly in fecal metagenomes of newborn premature infants. Results show that siblings share more microbial strains compared to unrelated infants, and infants born via cesarean section harbor bacteria with higher nucleotide diversity than vaginally delivered infants, potentially due to hospital acquisition. InStrain can be applied to analyze microdiversity and strain comparison in any metagenomic dataset.

NATURE BIOTECHNOLOGY (2021)

Article Biochemistry & Molecular Biology

eggNOG-mapper v2: Functional Annotation, Orthology Assignments, and Domain Prediction at the Metagenomic Scale

Carlos P. Cantalapiedra et al.

Summary: The article introduces a major upgrade of the eggNOG-mapper tool, optimized for functional annotation of vast genomic and metagenomic datasets, including database updates, efficiency enhancements, and new features such as de novo gene prediction and fast protein domain discovery.

MOLECULAR BIOLOGY AND EVOLUTION (2021)

Article Biotechnology & Applied Microbiology

GUNC: detection of chimerism and contamination in prokaryotic genomes

Askarbek Orakov et al.

Summary: GUNC is a tool that accurately detects and quantifies genome chimerism based on lineage homogeneity of individual contigs, providing a fast and robust way to improve prokaryotic genome quality. It targets previously underdetected types of contamination and can substantially enhance genome quality.

GENOME BIOLOGY (2021)

Article Biochemistry & Molecular Biology

Pfam: The protein families database in 2021

Jaina Mistry et al.

Summary: The Pfam database has recently added a large number of protein families and domains, made revisions for COVID-19 research, and introduced Pfam-B as a supplement. These updates and improvements can help researchers classify protein sequences more effectively and conduct related studies.

NUCLEIC ACIDS RESEARCH (2021)

Article Biochemistry & Molecular Biology

Genomes OnLine Database (GOLD) v.8: overview and updates

Supratim Mukherjee et al.

Summary: The Genomes OnLine Database (GOLD) is a manually curated collection of genome projects and their metadata, with over 1.17 million entries. Users can browse, search, and input project details in GOLD, ensuring accurate metadata documentation for analysis. The database also imports projects from public repositories to maintain a reference dataset for the scientific community.

NUCLEIC ACIDS RESEARCH (2021)

Article Biochemistry & Molecular Biology

IMG/M v.5.0: an integrated data management and comparative analysis system for microbial genomes and microbiomes

I-Min A. Chen et al.

NUCLEIC ACIDS RESEARCH (2019)

Article Multidisciplinary Sciences

Microbial abundance, activity and population genomic profiling with mOTUs2

Alessio Milanese et al.

NATURE COMMUNICATIONS (2019)

Article Biochemistry & Molecular Biology

VFDB 2019: a comparative pathogenomic platform with an interactive web interface

Bo Liu et al.

NUCLEIC ACIDS RESEARCH (2019)

Article Biochemistry & Molecular Biology

dbCAN2: a meta server for automated carbohydrate-active enzyme annotation

Han Zhang et al.

NUCLEIC ACIDS RESEARCH (2018)

Article Biotechnology & Applied Microbiology

A standardized bacterial taxonomy based on genome phylogeny substantially revises the tree of life

Donovan H. Parks et al.

NATURE BIOTECHNOLOGY (2018)

Article Biochemistry & Molecular Biology

proGenomes: a resource for consistent functional and taxonomic annotations of prokaryotic genomes

Daniel R. Mende et al.

NUCLEIC ACIDS RESEARCH (2017)

Article Multidisciplinary Sciences

A communal catalogue reveals Earth's multiscale microbial diversity

Luke R. Thompson et al.

NATURE (2017)

Article Mathematical & Computational Biology

Duplicates, redundancies and inconsistencies in the primary nucleotide databases: a descriptive study

Qingyu Chen et al.

DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION (2017)

Article Biochemical Research Methods

MAPseq: highly efficient k-mer search with confidence estimates, for rRNA sequence analysis

Joao F. Matias Rodrigues et al.

BIOINFORMATICS (2017)

Article Biochemistry & Molecular Biology

Ensembl Genomes 2016: more genomes, more complexity

Paul Julian Kersey et al.

NUCLEIC ACIDS RESEARCH (2016)

Article Biochemistry & Molecular Biology

ETE 3: Reconstruction, Analysis, and Visualization of Phylogenomic Data

Jaime Huerta-Cepas et al.

MOLECULAR BIOLOGY AND EVOLUTION (2016)

Article Mathematical & Computational Biology

The environment ontology in 2016: bridging domains with increased scope, semantic density, and interoperation

Pier Luigi Buttigieg et al.

JOURNAL OF BIOMEDICAL SEMANTICS (2016)

Article Multidisciplinary Sciences

VSEARCH: a versatile open source tool for metagenomics

Torbjorn Rognes et al.

PEERJ (2016)

Article Microbiology

A new view of the tree of life

Laura A. Hug et al.

NATURE MICROBIOLOGY (2016)

Article Multidisciplinary Sciences

FAMSA: Fast and accurate multiple sequence alignment of huge protein families

Sebastian Deorowicz et al.

SCIENTIFIC REPORTS (2016)

Article Biotechnology & Applied Microbiology

Mash: fast genome and metagenome distance estimation using MinHash

Brian D. Ondov et al.

GENOME BIOLOGY (2016)

Article Biochemistry & Molecular Biology

CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes

Donovan H. Parks et al.

GENOME RESEARCH (2015)

Article Biochemistry & Molecular Biology

Update on RefSeq microbial genomes resources

Tatiana Tatusova et al.

NUCLEIC ACIDS RESEARCH (2015)

Article Biochemistry & Molecular Biology

PATRIC, the bacterial bioinformatics database and analysis resource

Alice R. Wattam et al.

NUCLEIC ACIDS RESEARCH (2014)

Article Biochemical Research Methods

Accurate and universal delineation of prokaryotic species

Daniel R. Mende et al.

NATURE METHODS (2013)

Article Biochemistry & Molecular Biology

ICEberg: a web-based resource for integrative and conjugative elements found in Bacteria

Dexi Bi et al.

NUCLEIC ACIDS RESEARCH (2012)

Article Biochemistry & Molecular Biology

ACLAME: A CLAssification of Mobile genetic Elements, update 2010

Raphael Leplae et al.

NUCLEIC ACIDS RESEARCH (2010)

Article Multidisciplinary Sciences

FastTree 2-Approximately Maximum-Likelihood Trees for Large Alignments

Morgan N. Price et al.

PLOS ONE (2010)

Review Microbiology

Microbiology in the post-genomic era

Duccio Medini et al.

NATURE REVIEWS MICROBIOLOGY (2008)

Article Multidisciplinary Sciences

Genome-wide experimental determination of barriers to horizontal gene transfer

Rotem Sorek et al.

SCIENCE (2007)

Review Biology

Advanced sequencing technologies and their wider impact in microbiology

Neil Hall

JOURNAL OF EXPERIMENTAL BIOLOGY (2007)

Article Multidisciplinary Sciences

Toward automatic reconstruction of a highly resolved tree of life

FD Ciccarelli et al.

SCIENCE (2006)

Article Biochemistry & Molecular Biology

ISfinder: the reference centre for bacterial insertion sequences

P. Siguier et al.

NUCLEIC ACIDS RESEARCH (2006)