4.8 Article

Ensembl 2022

Related references

Note: Only part of the references are listed.
Article Biochemistry & Molecular Biology

GENCODE 2021

Adam Frankish et al.

Summary: The GENCODE project annotates human and mouse genes and transcripts with high accuracy, using experimental data and bioinformatic tools, and continues to improve annotation infrastructure and tools for the human and mouse genomes. This includes manual annotation for the mouse reference genome, targeted improvements for SARS-CoV-2 related genes, collaborative projects for reference annotation databases, and the first GENCODE supervised automated annotation of lncRNAs.

NUCLEIC ACIDS RESEARCH (2021)

Article Genetics & Heredity

The Dfam community resource of transposable element families, sequence models, and genome annotations

Jessica Storer et al.

Summary: Dfam is an open access database that has evolved from a proof-of-principle collection of transposable element families in model organisms to a community resource for a broad range of species. The latest release includes 266,740 new de novo generated transposable element families from 336 species contributed by the EBI, demonstrating the utility of Dfam's new features and the long term challenges ahead for improving de novo generated transposable element datasets.

MOBILE DNA (2021)

Article Biochemical Research Methods

Effective gene expression prediction from sequence by integrating long-range interactions

Ziga Avsec et al.

Summary: Enformer leverages a new deep learning architecture to improve gene expression prediction accuracy based on DNA sequences, integrating information from long-range interactions in the genome and accurately predicting the impact of genetic variants on gene expression. Additionally, Enformer has learned to predict enhancer-promoter interactions directly from DNA sequences.

NATURE METHODS (2021)

Article Multidisciplinary Sciences

Towards complete and error-free genome assemblies of all vertebrate species

Arang Rhie et al.

Summary: The Vertebrate Genome Project and the international Genome 10K consortium have collaborated to generate high-quality genome assemblies for 16 species representing six major vertebrate lineages, leading to new biological discoveries. Long-read sequencing technologies are essential for maximizing genome quality, and addressing complex repeats and haplotype heterozygosity are crucial for reducing assembly errors and improving completeness of reference genomes. The lessons learned from this project have paved the way for the Vertebrate Genomes Project (VGP), an international effort to generate high-quality, complete reference genomes for all known vertebrate species.

NATURE (2021)

Article Biochemical Research Methods

Sensitive protein alignments at tree-of-life scale using DIAMOND

Benjamin Buchfink et al.

Summary: We are at the beginning of a genomic revolution where all known species are planned to be sequenced. The improved version of DIAMOND allows for quick tree-of-life scale protein alignments.

NATURE METHODS (2021)

Article Genetics & Heredity

A compendium of uniformly processed human gene expression and splicing quantitative trait loci

Nurlan Kerimov et al.

Summary: The study presents a resource of quality-controlled gene expression and splicing QTLs from 21 studies, demonstrating highly reproducible eQTL effect sizes for matching cell types and tissues. It also identified a greater diversity of cell-type-specific QTLs, some of which manifested as new disease co-localizations. The summary statistics provided are freely available for systematic interpretation of human GWAS associations across various cell types and tissues.

NATURE GENETICS (2021)

Article Biochemical Research Methods

ReFeaFi: Genome-wide prediction of regulatory elements driving transcription initiation

Ramzan Umarov et al.

Summary: Accurate identification of regulatory elements like promoters and enhancers is crucial for understanding gene expression patterns. While many attempts have been made to develop computational methods, reliable tools for analyzing long genomic sequences are still lacking. To address this issue, the authors propose a dynamic negative set updating scheme and use a two-model approach, achieving good performance at the genome level.

PLOS COMPUTATIONAL BIOLOGY (2021)

Article Biochemistry & Molecular Biology

Ensembl 2021

Kevin L. Howe et al.

Summary: The Ensembl project provides genome annotation and data dissemination services for vertebrate species, including detailed annotation of gene structures, regulatory elements, and variants, as well as inferring the evolutionary history of genes and genomes. They offer integrated genomic data through various means such as genome browsers, search interfaces, specialist tools, and download files. Recent developments include the Ensembl Rapid Release and the SARS-CoV-2 genome browser, aiding in the international scientific response to the COVID-19 pandemic.

NUCLEIC ACIDS RESEARCH (2021)

Article Biochemical Research Methods

SPDI: data model for variants and applications at NCBI

J. Bradley Holmes et al.

BIOINFORMATICS (2020)

Article Biochemistry & Molecular Biology

PDBe-KB: a community-driven resource for structural and functional annotations

Mihaly Varadi et al.

NUCLEIC ACIDS RESEARCH (2020)

Article Multidisciplinary Sciences

RepeatModeler2 for automated genomic discovery of transposable element families

Jullien M. Flynn et al.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2020)

Article Multidisciplinary Sciences

Progressive Cactus is a multiple-genome aligner for the thousand-genome era

Joel Armstrong et al.

NATURE (2020)

Article Biotechnology & Applied Microbiology

Transcriptome assembly from long-read RNA-seq alignments with StringTie2

Sam Kovaka et al.

GENOME BIOLOGY (2019)

Article Genetics & Heredity

Predicting the clinical impact of human mutation with deep neural networks

Laksshman Sundaram et al.

NATURE GENETICS (2018)

Article Multidisciplinary Sciences

Earth BioGenome Project: Sequencing life for the future of life

Harris A. Lewin et al.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2018)

Article Genetics & Heredity

ClinPred: Prediction Tool to Identify Disease-Relevant Nonsynonymous Single-Nucleotide Variants

Najmeh Alirezaie et al.

AMERICAN JOURNAL OF HUMAN GENETICS (2018)

Article Biotechnology & Applied Microbiology

Accurate assembly of transcripts through phase-preserving graph decomposition

Mingfu Shao et al.

NATURE BIOTECHNOLOGY (2017)

Article Biochemistry & Molecular Biology

The International Human Epigenome Consortium: A Blueprint for Scientific Collaboration and Discovery

Hendrik G. Stunnenberg et al.

Article Genetics & Heredity

HGVS Recommendations for the Description of Sequence Variants: 2016 Update

Johan T. den Dunnen et al.

HUMAN MUTATION (2016)

Article Biotechnology & Applied Microbiology

The Ensembl Variant Effect Predictor

William McLaren et al.

GENOME BIOLOGY (2016)

Article Biochemical Research Methods

Red: an intelligent, rapid, accurate tool for detecting repeats de-novo on the genomic scale

Hani Z. Girgis

BMC BIOINFORMATICS (2015)

Article Biochemical Research Methods

The Ensembl REST API: Ensembl Data for Any Language

Andrew Yates et al.

BIOINFORMATICS (2015)

Article Biochemistry & Molecular Biology

RefSeq: an update on mammalian reference sequences

Kim D. Pruitt et al.

NUCLEIC ACIDS RESEARCH (2014)

Article Biochemistry & Molecular Biology

ClinVar: public archive of relationships among sequence variation and human phenotype

Melissa J. Landrum et al.

NUCLEIC ACIDS RESEARCH (2014)

Article Biochemical Research Methods

STAR: ultrafast universal RNA-seq aligner

Alexander Dobin et al.

BIOINFORMATICS (2013)

Letter Biotechnology & Applied Microbiology

BLUEPRINT to decode the epigenetic signature written in blood

David Adams et al.

NATURE BIOTECHNOLOGY (2012)

News Item Multidisciplinary Sciences

GENOMICS ENCODE Project Writes Eulogy For Junk DNA

Elizabeth Pennisi

SCIENCE (2012)

Article Biochemistry & Molecular Biology

EnsemblCompara GeneTrees: Complete, duplication-aware phylogenetic trees in vertebrates

Albert J. Vilella et al.

GENOME RESEARCH (2009)

Article Biochemistry & Molecular Biology

dbSNP: the NCBI database of genetic variation

ST Sherry et al.

NUCLEIC ACIDS RESEARCH (2001)