4.8 Article

A pangenome graph reference of 30 chicken genomes allows genotyping of large and complex structural variants

Related references

Note: Only part of the references are listed.
Article Cell Biology

A chromosome-level reference genome and pangenome for barn swallow population genomics

Simona Secomandi et al.

Summary: This study provides a chromosome-level reference genome and pangenome for the barn swallow, allowing for the identification of potentially conserved and accelerated genes and the inference of core and accessory genes. These resources will facilitate population genomics studies, aid in detecting candidate genes in comparative genomics studies, and reduce bias towards a single reference genome.

CELL REPORTS (2023)

Article Multidisciplinary Sciences

Pangenome obtained by long-read sequencing of 11 genomes reveal hidden functional structural variants in pigs

Yi-Fan Jiang et al.

Summary: This study utilizes long-read sequencing to construct a pig pangenome, revealing novel sequences and structural variants. Population stratified structural variants are identified through analysis of additional short-read sequencing samples, and candidate genes potentially associated with high-altitude hypoxia adaptation are found within these structural variants.

ISCIENCE (2023)

Article Multidisciplinary Sciences

A draft human pangenome reference

Wen-Wei Liao et al.

NATURE (2023)

Article Biotechnology & Applied Microbiology

Pangenome graph construction from genome alignments with Minigraph-Cactus

Glenn Hickey et al.

Summary: Genome assemblies are used to directly construct genome graphs, which can represent various forms of genetic variation and improve analysis accuracy by overcoming single-reference bias.

NATURE BIOTECHNOLOGY (2023)

Article Biotechnology & Applied Microbiology

Telomere-to-telomere assembly of diploid chromosomes with Verkko

Mikko Rautiainen et al.

Summary: The Telomere-to-Telomere consortium has achieved the first complete sequence of a human genome. They used a combination of long Nanopore sequencing reads and high-resolution assembly graph to resolve repeat sequences and automate the process in their Verkko pipeline. The result is a phased, diploid assembly with many chromosomes assembled from end to end. This advance is crucial for constructing comprehensive pangenome databases and chromosome-scale comparative genomics.

NATURE BIOTECHNOLOGY (2023)

Article Multidisciplinary Sciences

Evolutionary analysis of a complete chicken genome

Zhen Huang et al.

Summary: Microchromosomes are prevalent in nonmammalian vertebrates, but some are missing in bird genome assemblies. This study presents a new chicken reference genome, including all autosomes and sex chromosomes, and provides detailed characterization of small microchromosomes (dot chromosomes) with unique sequence and epigenetic features, shedding insights into the structure and evolution of chromosomes.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2023)

Article Multidisciplinary Sciences

The immune cell landscape and response of Marek's disease resistant and susceptible chickens infected with Marek's disease virus

Wesley C. Warren et al.

Summary: Researchers used single-cell RNA sequencing to analyze splenic cells from chickens infected with Marek's disease virus (MDV). They identified various immune cell types, with T cell subtypes being the most abundant and granulocytes showing the largest number of differentially expressed genes. This study provides valuable insights into the immune response to viral infection and facilitates further research on host immunity.

SCIENTIFIC REPORTS (2023)

Article Cell Biology

Fourth Report on Chicken Genes and Chromosomes 2022

Jacqueline Smith et al.

CYTOGENETIC AND GENOME RESEARCH (2023)

Article Biotechnology & Applied Microbiology

Graph construction method impacts variation representation and analyses in a bovine super-pangenome

Alexander S. Leonard et al.

Summary: Three methods, pggb, cactus, and minigraph, were used to construct multi-species super-pangenomes. These methods show good consensus in representing structural variations, but have differences in private variations and variable number tandem repeats (VNTRs) analysis.

GENOME BIOLOGY (2023)

Article Biochemical Research Methods

Unbiased pangenome graphs

Erik Garrison et al.

Summary: This study presents the seqwish algorithm, which can build a variation graph from a set of sequences and their alignments. By transforming the alignment set into a tree-based representation and querying this representation, the algorithm constructs a variation graph, resulting in a pangenome variation graph. The method is scalable and has been successfully applied to build pangenome graphs for multiple species.

BIOINFORMATICS (2023)

Article Biotechnology & Applied Microbiology

Systematic benchmark of state-of-the-art variant calling pipelines identifies major factors affecting accuracy of coding sequence variant discovery

Yury A. Barbitoff et al.

Summary: This study systematically evaluated the performance of various read aligners and variant calling software, finding that DeepVariant performed the best in terms of accuracy and robustness. The study highlights the importance of regular benchmarking and the need for a more diverse set of gold standard genomes.

BMC GENOMICS (2022)

Article Biochemical Research Methods

ODGI: understanding pangenome graphs

Andrea Guarracino et al.

Summary: Pangenome graphs provide a complete representation of genomic diversity, but analyzing large-scale genome data using existing tools is challenging. Optimized Dynamic Genome/Graph Implementation (ODGI) is a new tool suite with efficient in-memory representation and support for various operations and visualization. Its parallel execution helps answer complex biological questions quickly.

BIOINFORMATICS (2022)

Article Biochemistry & Molecular Biology

De Novo Assembly of 20 Chicken Genomes Reveals the Undetectable Phenomenon for Thousands of Core Genes on Microchromosomes and Subtelomeric Regions

Ming Li et al.

Summary: The gene numbers and evolutionary rates of birds are lower than those of mammals, but birds have a large species number and morphological diversity. To understand avian evolution, it is necessary to study the complete avian genome. A chicken pan-genome was constructed from 20 de novo assembled genomes, revealing novel protein-coding genes and long noncoding RNAs not found in previous databases. These hidden genes were found to be shared by all chicken genomes, including many housekeeping genes, and were enriched in immune pathways. Comparative genomics showed that these novel genes have higher substitution rates than known genes, updating our knowledge about evolutionary rates in birds. This study provides a framework for constructing a better chicken genome, contributing to the understanding of avian evolution and improvement of poultry breeding.

MOLECULAR BIOLOGY AND EVOLUTION (2022)

Article Biotechnology & Applied Microbiology

Haplotype-resolved assembly of diploid genomes without parental data

Haoyu Cheng et al.

Summary: This paper presents an algorithm that combines PacBio HiFi reads and Hi-C chromatin interaction data to achieve haplotype-resolved genome assembly from single samples without the need for parent sequencing. The algorithm outperforms existing single-sample assembly pipelines and produces assemblies of similar quality to the best pedigree-based assemblies when applied to human and other vertebrate samples.

NATURE BIOTECHNOLOGY (2022)

Article Multidisciplinary Sciences

The complete sequence of a human genome

Sergey Nurk et al.

Summary: The Telomere-to-Telomere (T2T) Consortium has presented a complete sequence of a human genome, T2T-CHM13, which covers the whole genome except for the Y chromosome. This new sequence includes gapless assemblies, error corrections in previous references, and nearly 200 million base pairs of additional sequence with gene predictions, including protein coding genes. The completion of important regions allows for further studies on genetic variations and functions.

SCIENCE (2022)

Article Biochemical Research Methods

A spectrum of free software tools for processing the VCF variant call format: vcflib, bio-vcf, cyvcf2, hts-nim and slivar

Erik Garrison et al.

Summary: The author presents a range of free and open source software tools used in biomedical sequencing workflows for DNA/RNA variations. The importance of the variant call format (VCF) is highlighted, and the article discusses how to handle more complex variations.

PLOS COMPUTATIONAL BIOLOGY (2022)

Article Biochemical Research Methods

GBZ file format for pangenome graphs

Jouni Siren et al.

Summary: In this paper, a GBZ file format based on the data structures used in the Giraffe short-read aligner is proposed for storing pangenome graphs. The format achieves good compression and efficient loading into in-memory data structures. Tools and libraries for compression and decompression of GBZ graphs are provided and shown to be efficient on various systems.

BIOINFORMATICS (2022)

Article Biochemistry & Molecular Biology

Assembly of a pangenome for global cattle reveals missing sequences and novel structural variations, providing new insights into their diversity and evolutionary history

Yang Zhou et al.

Summary: A cattle pangenome representation was created based on the genome sequences of 898 cattle representing 57 breeds. The pangenome identified novel sequence and structural variants, providing valuable insights into the diversity and evolutionary history of cattle.

GENOME RESEARCH (2022)

Article Multidisciplinary Sciences

Deadly bird flu establishes a foothold in North America

Erik Stokstad

SCIENCE (2022)

Article Genetics & Heredity

Comprehensive analysis of structural variants in chickens using PacBio sequencing

Jinxin Zhang et al.

Summary: This study explored structural variants (SVs) in chickens using PacBio technology and detected a high number of SVs compared to Illumina short-read technology. It was found that during chicken domestication, beneficial SVs were retained while deleterious SVs were eliminated. This study contributes to our understanding of genetic characteristics and genomic diversity in chickens.

FRONTIERS IN GENETICS (2022)

Article Biotechnology & Applied Microbiology

Widespread false gene gains caused by duplication errors in genome assemblies

Byung June Ko et al.

Summary: This study quantifies false duplications in previous genome assemblies for platypus, zebra finch, and Anna's Hummingbird, and highlights the need for more advanced assembly methods and cautious gene gain analysis. The main source of false duplications is heterotype duplications, while a minor source is sequencing errors. The study emphasizes the importance of accurately separating haplotypes and sequence errors in genome assemblies.

GENOME BIOLOGY (2022)

Article Biochemical Research Methods

Fast gap-affine pairwise alignment using the wavefront algorithm

Santiago Marco-Sola et al.

Summary: In this article, a wavefront alignment algorithm (WFA) is proposed as an efficient method for accelerating sequence alignment, significantly outperforming traditional algorithms in terms of speed and memory usage. Experimental results demonstrate that WFA is 20-300x faster than other methods, capable of aligning different types of sequences.

BIOINFORMATICS (2021)

Article Biology

Telomere-to-telomere gapless chromosomes of banana using nanopore sequencing

Caroline Belser et al.

Summary: A chromosome-scale assembly of a banana genome (Musa acuminata) was reported using Oxford Nanopore long-reads, with five out of the eleven chromosomes entirely reconstructed in a single contig from telomere to telomere, revealing the content of complex regions like centromeres or clusters of paralogous genes for the first time.

COMMUNICATIONS BIOLOGY (2021)

Article Agriculture, Dairy & Animal Science

Identification of new genes and quantitative trait locis associated with growth curve parameters in F2 chicken population using genome-wide association study

R. Seifi Moroudi et al.

Summary: This study identified a set of biomarkers associated with growth curve parameters in crossbred chickens through GWAS, revealing important genes related to chicken growth and meat quality, as well as other genes associated with body weight, average daily gain and growth QTL. These findings shed light on the genetic mechanism of growth factors in broiler chickens and provide insights for developing management practices and accelerating genetic progress in breeding programs.

ANIMAL GENETICS (2021)

Article Agriculture, Dairy & Animal Science

Assessing the effects of rare alleles and linkage disequilibrium on estimates of genetic diversity in the chicken populations

N. Dementieva et al.

Summary: Phenotypic diversity in poultry is mainly influenced by artificial selection and genetic drift. This study assessed genetic diversity within and between Russian chicken breeds and populations, revealing genetic differences among populations. LD pruning is crucial for studying genetic diversity, with the Russian White RG population showing a significant impact due to its large sample size.

ANIMAL (2021)

Article Biochemical Research Methods

Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm

Haoyu Cheng et al.

Summary: hifiasm is a novel assembler that utilizes long high-fidelity sequence reads to accurately represent haplotype information, outperforming existing tools in haplotype-resolved assembly on various datasets, including a hexaploid genome dataset.

NATURE METHODS (2021)

Article Multidisciplinary Sciences

Multithreaded variant calling in elPrep 5

Charlotte Herzeel et al.

Summary: elPrep 5 updates the elPrep framework for processing sequencing alignment/map files with variant calling, significantly reducing runtime by parallelizing and merging the execution of the pipeline steps. It can be a suitable replacement for GATK4 when faster execution times are needed.

PLOS ONE (2021)

Article Multidisciplinary Sciences

Pangenomics enables genotyping of known structural variants in 5202 diverse genomes

Jouni Siren et al.

Summary: Giraffe is a pangenome short-read mapper that efficiently maps to a collection of haplotypes threaded through a sequence graph. It speeds up mapping to thousands of human genomes and enables improved accuracy in genome-wide genotyping, ultimately enhancing genomic analyses. This tool facilitates a more comprehensive characterization of variation and has the potential to benefit various genomic studies.

SCIENCE (2021)

Article Agriculture, Dairy & Animal Science

The impact of endogenous Avian Leukosis Viruses (ALVE) on production traits in elite layer lines

Janet E. Fulton et al.

Summary: Recent studies have shown that the presence of Avian Leukosis Virus subgroup E (ALVE) in the chicken genome has negative impacts on production traits, such as egg production and egg quality. Specific ALVE inserts were found to be associated with commercially relevant performance traits in elite commercial egg production lines, suggesting a potential link between ALVE presence and chicken production traits. This association may be due to the effect of the virus, direct gene alterations caused by insertional mutagenesis, or the integration site's proximity to quantitative trait regions impacting performance traits.

POULTRY SCIENCE (2021)

Article Biology

Twelve years of SAMtools and BCFtools

Petr Danecek et al.

Summary: SAMtools and BCFtools are widely used tools for processing high-throughput sequencing data, with a history of 12 years of continuous development and improvement. These packages have been utilized in various software projects and genomic pipelines and are freely available on GitHub.

GIGASCIENCE (2021)

Article Genetics & Heredity

Expectations and blind spots for structural variation detection from long-read assemblies and short-read genome sequencing technologies

Xuefang Zhao et al.

Summary: Short-read whole-genome sequencing (srWGS) is widely used in large-scale genomics initiatives but faces challenges in detecting structural variants (SVs), overcome by emerging long-read WGS (lrWGS) technologies. The detection power and precision for SV discovery vary significantly by genomic context and variant class.

AMERICAN JOURNAL OF HUMAN GENETICS (2021)

Article Biotechnology & Applied Microbiology

Telomere-to-telomere assembly of the genome of an individual Oikopleura dioica from Okinawa using Nanopore-based sequencing

Aleksandra Bliznina et al.

Summary: This study presents a chromosome-scale genome assembly of the larvacean Oikopleura dioica using a hybrid approach of multiple sequencing technologies and chromosome conformation information. The assembly revealed complete autosomes as well as proposed sex chromosomes, allowing for cross-genome comparisons and studies of chromosomal evolution in this lineage.

BMC GENOMICS (2021)

Article Immunology

Dual Host and Pathogen RNA-Seq Analysis Unravels Chicken Genes Potentially Involved in Resistance to Highly Pathogenic Avian Influenza Virus Infection

Albert Perlas et al.

Summary: The study aimed to evaluate the mechanisms of disease resistance to HPAIV in chickens of different breeds. RNA-Seq results showed minor transcriptomic changes in resistant chickens and significant alterations in susceptible chickens, with some genes related to NF-κB and mitogen-activated protein kinase signaling pathways. The early inactivation of important host genes could prevent an exaggerated immune response and/or viral replication, conferring resistance to HPAIV in chickens.

FRONTIERS IN IMMUNOLOGY (2021)

Article Multidisciplinary Sciences

Towards complete and error-free genome assemblies of all vertebrate species

Arang Rhie et al.

Summary: The Vertebrate Genome Project and the international Genome 10K consortium have collaborated to generate high-quality genome assemblies for 16 species representing six major vertebrate lineages, leading to new biological discoveries. Long-read sequencing technologies are essential for maximizing genome quality, and addressing complex repeats and haplotype heterozygosity are crucial for reducing assembly errors and improving completeness of reference genomes. The lessons learned from this project have paved the way for the Vertebrate Genomes Project (VGP), an international effort to generate high-quality, complete reference genomes for all known vertebrate species.

NATURE (2021)

Article Biochemistry & Molecular Biology

The Chicken Pan-Genome Reveals Gene Content Variation and a Promoter Region Deletion in IGF2BP1 Affecting Body Size

Kejun Wang et al.

Summary: The study of chicken pan-genome has revealed the evolutionary processes of genome construction, showing the factors influencing gene expression levels and gene presence/absence variations. Through PAV-based genome-wide association studies, multiple candidate mutations related to growth, meat quality, etc. have been identified.

MOLECULAR BIOLOGY AND EVOLUTION (2021)

Review Genetics & Heredity

Towards population-scale long-read sequencing

Wouter De Coster et al.

Summary: Long-read sequencing technologies have advanced to the point where they can be applied to variant detection at a population scale. New computational tools have led to the emergence of population-scale studies in the past two years, with many more expected in the future. The review covers recent developments, challenges, experimental design guidance, as well as strategies for variant validation and genotyping.

NATURE REVIEWS GENETICS (2021)

Article Biotechnology & Applied Microbiology

Telomere-to-telomere assembly of a fish Y chromosome reveals the origin of a young sex chromosome pair

Lingzhan Xue et al.

Summary: The study on the haplotype-resolved genome assembly of zig-zag eel sheds light on the evolution of sex chromosomes and recombination suppression mechanisms, revealing a similar sex-linked region on the X and Y chromosomes and identifying a potential sex-determining gene in the SLR.

GENOME BIOLOGY (2021)

Article Biochemical Research Methods

Identifying and removing haplotypic duplication in primary genome assemblies

Dengfeng Guan et al.

BIOINFORMATICS (2020)

Review Genetics & Heredity

Pangenome Graphs

Jordan M. Eizenga et al.

ANNUAL REVIEW OF GENOMICS AND HUMAN GENETICS, VOL 21, 2020 (2020)

Article Agriculture, Dairy & Animal Science

Whole-genome resequencing of Dulong Chicken reveal signatures of selection

Q. Wang et al.

BRITISH POULTRY SCIENCE (2020)

Article Biotechnology & Applied Microbiology

The design and construction of reference pangenome graphs with minigraph

Heng Li et al.

GENOME BIOLOGY (2020)

Article Agriculture, Dairy & Animal Science

Diversity of endogenous avian leukosis virus subgroup E (ALVE) insertions in indigenous chickens

Andrew S. Mason et al.

GENETICS SELECTION EVOLUTION (2020)

Article Biotechnology & Applied Microbiology

Genotyping structural variants in pangenome graphs using the vg toolkit

Glenn Hickey et al.

GENOME BIOLOGY (2020)

Article Biotechnology & Applied Microbiology

Population genomics identifies patterns of genetic diversity and selection in chicken

Diyan Li et al.

BMC GENOMICS (2019)

Article Biotechnology & Applied Microbiology

The SYNBREED chicken diversity panel: a global resource to assess chicken diversity at high genomic resolution

Dorcus Kholofelo Malomane et al.

BMC GENOMICS (2019)

Article Biotechnology & Applied Microbiology

A new chicken 55K SNP genotyping array

Ranran Liu et al.

BMC GENOMICS (2019)

Article Biochemistry & Molecular Biology

Characterizing the Major Structural Variant Alleles of the Human Genome

Peter A. Audano et al.

Review Biotechnology & Applied Microbiology

Structural variant calling: the long and the short of it

Medhat Mahmoud et al.

GENOME BIOLOGY (2019)

Article Biotechnology & Applied Microbiology

Paragraph: a graph-based structural variant genotyper for short-read sequence data

Sai Chen et al.

GENOME BIOLOGY (2019)

Article Agriculture, Dairy & Animal Science

Endogenous viral gene ev21 is not responsible for the expression of late feathering in chickens

A. Takenouchi et al.

POULTRY SCIENCE (2018)

Article Biochemical Research Methods

Minimap2: pairwise alignment for nucleotide sequences

Heng Li

BIOINFORMATICS (2018)

Article Agriculture, Dairy & Animal Science

Analysis of a genetic factors contributing to feathering phenotype in chickens

Xiuling Zhang et al.

POULTRY SCIENCE (2018)

Article Biochemical Research Methods

A fast adaptive algorithm for computing whole-genome homology maps

Chirag Jain et al.

BIOINFORMATICS (2018)

Article Biotechnology & Applied Microbiology

Variation graph toolkit improves read mapping by representing genetic variation in the reference

Erik Garrison et al.

NATURE BIOTECHNOLOGY (2018)

Article Biotechnology & Applied Microbiology

A universal SNP and small-indel variant caller using deep neural networks

Ryan Poplin et al.

NATURE BIOTECHNOLOGY (2018)

Article Multidisciplinary Sciences

Systematic evaluation of error rates and causes in short samples in next-generation sequencing

Franziska Pfeiffer et al.

SCIENTIFIC REPORTS (2018)

Article Agriculture, Dairy & Animal Science

Genetic assessment of inbred chicken lines indicates genomic signatures of resistance to Marek's disease

Lingyang Xu et al.

JOURNAL OF ANIMAL SCIENCE AND BIOTECHNOLOGY (2018)

Article Agriculture, Dairy & Animal Science

Identifying the genetic basis for resistance to avian influenza in commercial egg layer chickens

W. Drobik-Czwarno et al.

ANIMAL (2018)

Article Biotechnology & Applied Microbiology

Scaffolding of long read assemblies using long range contact information

Jay Ghurye et al.

BMC GENOMICS (2017)

Article Biochemical Research Methods

gEVAL-a web-based browser for evaluating genome assemblies

William Chow et al.

BIOINFORMATICS (2016)

Article Agriculture, Dairy & Animal Science

Copy number variation identification and analysis of the chicken genome using a 60K SNP BeadChip

Y. S. Rao et al.

POULTRY SCIENCE (2016)

Article Biochemical Research Methods

Bandage: interactive visualization ofde novogenome assemblies: Fig. 1.

Ryan R. Wick et al.

BIOINFORMATICS (2015)

Article Genetics & Heredity

Mapping Bias Overestimates Reference Allele Frequencies at the HLA Genes in the 1000 Genomes Project Phase I Data

Debora Y. C. Brandt et al.

G3-GENES GENOMES GENETICS (2015)

Article Biochemical Research Methods

DELLY: structural variant discovery by integrated paired-end and split-read analysis

Tobias Rausch et al.

BIOINFORMATICS (2012)

Review Genetics & Heredity

Genotype and SNP calling from next-generation sequencing data

Rasmus Nielsen et al.

NATURE REVIEWS GENETICS (2011)

Article Biotechnology & Applied Microbiology

Partial duplication of the PRLR and SPEF2 genes at the late feathering locus in chicken

Martin G. Elferink et al.

BMC GENOMICS (2008)

Review Virology

The discovery of endogenous retroviruses

Robin A. Weiss

RETROVIROLOGY (2006)