4.5 Article

Bakta: rapid and standardized annotation of bacterial genomes via alignment- free sequence identification

相关参考文献

注意:仅列出部分参考文献,下载原文获取全部文献信息。
Article Microbiology

Automated Prediction and Annotation of Small Open Reading Frames in Microbial Genomes

Matthew G. Durrant et al.

Summary: Research reveals that small open reading frames (smORFs) and their encoded microproteins play central roles in microbes, with a vast unexplored space of smORFs within human-associated microbes. The introduction of SmORFinder combines profile hidden Markov models and deep learning models to predict small protein families enriched for Ribo-seq translation signals. Deep learning models are shown to identify Shine-Dalgarno sequences, deprioritize the wobble position in each codon, and group codon synonyms.

CELL HOST & MICROBE (2021)

Article Biochemistry & Molecular Biology

RefSeq: expanding the Prokaryotic Genome Annotation Pipeline reach with protein family model curation

Wenjun Li et al.

Summary: The RefSeq project at NCBI contains a vast number of bacterial and archaeal genomes and proteins, with a focus on reducing spurious annotation through the use of expanded protein family models. The Protein Family Models Entrez database provides users with access to the PFMs, supporting multi-genome analyses and connections to the literature. The reference and representative genome set of prokaryotic genomes within RefSeq is regularly recalculated and available for download and BLAST searches.

NUCLEIC ACIDS RESEARCH (2021)

Article Biochemistry & Molecular Biology

The Gene Ontology resource: enriching a GOld mine

Seth Carbon et al.

Summary: The Gene Ontology Consortium has made advancements in the last two years, such as improving the GO-CAM annotation framework, increasing the number of annotations and annotated gene products, and reviewing older annotations for consistency.

NUCLEIC ACIDS RESEARCH (2021)

Article Biochemistry & Molecular Biology

UniProt: the universal protein knowledgebase in 2021

Alex Bateman et al.

Summary: The UniProt Knowledgebase aims to provide users with a comprehensive, high-quality set of protein sequences annotated with functional information. Updates over the past two years have increased the number of sequences to approximately 190 million, with new methods to assess proteome completeness and quality. UniProtKB has responded to the COVID-19 pandemic by expertly curating relevant entries and making them rapidly available through a dedicated portal.

NUCLEIC ACIDS RESEARCH (2021)

Article Biochemistry & Molecular Biology

Rfam 14: expanded coverage of metagenomic, viral and microRNA families

Ioanna Kalvari et al.

Summary: Rfam is a database of RNA families with 3444 families, each represented by a multiple sequence alignment of known RNA sequences and a covariance model for searching additional members. Recent developments focused on improving data quality and coverage, adding new families like microRNAs, viral and bacterial RNAs through expert collaborations. The database saw significant growth with 759 new families added in Rfam 14, along with new features such as the Rfam Cloud family curation system.

NUCLEIC ACIDS RESEARCH (2021)

Review Biochemical Research Methods

Genome annotation of disease-causing microorganisms

Yibo Dong et al.

Summary: Humans have coexisted with pathogenic microorganisms and genome annotation of these microorganisms has become a challenging task. This paper summarizes the methods and tools for genome annotation of pathogenic microorganisms, conducts real-world comparisons, and discusses current challenges and issues.

BRIEFINGS IN BIOINFORMATICS (2021)

Article Microbiology

An integrated gene catalog and over 10,000 metagenome-assembled genomes from the gastrointestinal microbiome of ruminants

Fei Xie et al.

Summary: This study utilized shotgun metagenomics to profile the microbiota of 370 samples representing 10 GIT regions of seven ruminant species, reconstructing a GIT microbial reference catalog with over 154 million nonredundant genes and identifying 8745 uncultured candidate species. The newly characterized genomes substantially expanded the genomic landscape of ruminant microbiota and provided insights into methane production and feed efficiency in ruminants.

MICROBIOME (2021)

Article Genetics & Heredity

Post-weaning shifts in microbiome composition and metabolism revealed by over 25000 pig gut metagenome-assembled genomes

Daniela Gaio et al.

Summary: By analyzing time-series samples of the pig gut microbiome, it was found that a highly structured developmental program exists in piglet gut microbial communities following weaning, which is robust to interventions. Specific taxonomic 'signatures' and the carbohydrate repertoire of organisms resident in the porcine gut were identified, providing insights for the design of probiotics and prebiotic interventions to modify the piglet gut microbiome.

MICROBIAL GENOMICS (2021)

Article Biochemistry & Molecular Biology

COG database update: focus on microbial diversity, model organisms, and widespread pathogens

Michael Y. Galperin et al.

Summary: The COG database, created in 1997 and most recently updated in 2014, includes extensive information on bacterial and archaeal genomes. The current version introduces new features and plans for future expansion and refinement of annotations.

NUCLEIC ACIDS RESEARCH (2021)

Article Biotechnology & Applied Microbiology

A genomic catalog of Earth's microbiomes

Stephen Nayfach et al.

Summary: Reconstructing bacterial and archaeal genomes from shotgun metagenomes has led to the creation of a comprehensive catalog representing a significant expansion of the known phylogenetic diversity of bacteria and archaea. This resource is available for streamlined comparative analyses, interactive exploration, metabolic modeling, and bulk download, demonstrating the utility of genome-centric approaches for understanding genomic properties of uncultivated microorganisms.

NATURE BIOTECHNOLOGY (2021)

Article Genetics & Heredity

An assessment of genome annotation coverage across the bacterial tree of life

Briallen Lobb et al.

MICROBIAL GENOMICS (2020)

Review Biochemistry & Molecular Biology

Accurate and complete genomes from metagenomes

Lin-Xing Chen et al.

GENOME RESEARCH (2020)

Article Biochemical Research Methods

TORMES: an automated pipeline for whole bacterial genome analysis

Narciso M. Quijada et al.

BIOINFORMATICS (2019)

Article Biochemistry & Molecular Biology

VFDB 2019: a comparative pathogenomic platform with an interactive web interface

Bo Liu et al.

NUCLEIC ACIDS RESEARCH (2019)

Article Biochemistry & Molecular Biology

The European Nucleotide Archive in 2018

Peter W. Harrison et al.

NUCLEIC ACIDS RESEARCH (2019)

Article Biochemistry & Molecular Biology

The Pfam protein families database in 2019

Sara El-Gebali et al.

NUCLEIC ACIDS RESEARCH (2019)

Article Biochemistry & Molecular Biology

RefSeq: an update on prokaryotic genome annotation and curation

Daniel H. Haft et al.

NUCLEIC ACIDS RESEARCH (2018)

Article Biochemical Research Methods

fastp: an ultra-fast all-in-one FASTQ preprocessor

Shifu Chen et al.

BIOINFORMATICS (2018)

Article Biochemical Research Methods

DFAST: a flexible prokaryotic genome annotation pipeline for faster genome publication

Yasuhiro Tanizawa et al.

BIOINFORMATICS (2018)

Article Biochemical Research Methods

Unicycler: Resolving bacterial genome assemblies from short and long sequencing reads

Ryan R. Wick et al.

PLOS COMPUTATIONAL BIOLOGY (2017)

Article Multidisciplinary Sciences

Comment: The FAIR Guiding Principles for scientific data management and stewardship

Mark D. Wilkinson et al.

SCIENTIFIC DATA (2016)

Article Biochemistry & Molecular Biology

CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes

Donovan H. Parks et al.

GENOME RESEARCH (2015)

Article Biochemical Research Methods

Fast and sensitive protein alignment using DIAMOND

Benjamin Buchfink et al.

NATURE METHODS (2015)

Review Biochemistry & Molecular Biology

Small Proteins Can No Longer Be Ignored

Gisela Storz et al.

ANNUAL REVIEW OF BIOCHEMISTRY, VOL 83 (2014)

Article Biochemical Research Methods

Prokka: rapid prokaryotic genome annotation

Torsten Seemann

BIOINFORMATICS (2014)

Review Microbiology

Phenol-soluble modulins - critical determinants of staphylococcal virulence

Gordon Y. C. Cheung et al.

FEMS MICROBIOLOGY REVIEWS (2014)

Article Biochemical Research Methods

Infernal 1.1: 100-fold faster RNA homology searches

Eric P. Nawrocki et al.

BIOINFORMATICS (2013)

Article Biotechnology & Applied Microbiology

Improving prokaryotic transposable elements identification using a combination of de novo and profile HMM methods

Choumouss Kamoun et al.

BMC GENOMICS (2013)

Article Biochemistry & Molecular Biology

Small proteins link coat and cortex assembly during sporulation in Bacillus subtilis

Sarah E. Ebmeier et al.

MOLECULAR MICROBIOLOGY (2012)

Article Biochemistry & Molecular Biology

ExPASy: SIB bioinformatics resource portal

Panu Artimo et al.

NUCLEIC ACIDS RESEARCH (2012)

Article Mathematical & Computational Biology

AntiFam: a tool to help identify spurious ORFs in protein annotation

Ruth Y. Eberhardt et al.

DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION (2012)

Article Biochemical Research Methods

Accelerated Profile HMM Searches

Sean R. Eddy

PLOS COMPUTATIONAL BIOLOGY (2011)

Article Biochemical Research Methods

Prodigal: prokaryotic gene recognition and translation initiation site identification

Doug Hyatt et al.

BMC BIOINFORMATICS (2010)

Article Biochemical Research Methods

Biopython: freely available Python tools for computational molecular biology and bioinformatics

Peter J. A. Cock et al.

BIOINFORMATICS (2009)

Article Biochemical Research Methods

BLAST plus : architecture and applications

Christiam Camacho et al.

BMC BIOINFORMATICS (2009)

Article Biotechnology & Applied Microbiology

The RAST server: Rapid annotations using subsystems technology

Ramy K. Aziz et al.

BMC GENOMICS (2008)

Article Biochemical Research Methods

PILER-CR: Fast and accurate identification of CRISPR repeats

Robert C. Edgar

BMC BIOINFORMATICS (2007)

Article Biochemistry & Molecular Biology

BASys: a web server for automated bacterial genome annotation

GH Van Domselaar et al.

NUCLEIC ACIDS RESEARCH (2005)

Article Biochemistry & Molecular Biology

ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences

D Laslett et al.

NUCLEIC ACIDS RESEARCH (2004)

Article Biochemistry & Molecular Biology

GenDB -: an open source genome annotation system for prokaryote genomes

F Meyer et al.

NUCLEIC ACIDS RESEARCH (2003)

Article Cell Biology

Identification of novel small RNAs using comparative genomics and microarrays

KM Wassarman et al.

GENES & DEVELOPMENT (2001)