4.7 Review

Graph-based pan-genomes: increased opportunities in plant genomics

Related references

Note: Only part of the references are listed.
Article Multidisciplinary Sciences

AnchorWave: Sensitive alignment of genomes with high sequence diversity, extensive structural and polymorphism, and whole-genome duplication

Baoxing Song et al.

Summary: This study introduces a genome alignment method called AnchorWave, which shows significant improvement when applied to species with complex genomes. It can accurately identify multikilobase indels and improve the recall rate of transcription factor-binding sites.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2022)

Review Plant Sciences

Twenty years of plant genome sequencing: achievements and challenges

Yanqing Sun et al.

Summary: The publication of plant genome sequences has advanced research in plant biology and provided crucial information for crop breeding.

TRENDS IN PLANT SCIENCE (2022)

Article Biochemical Research Methods

ODGI: understanding pangenome graphs

Andrea Guarracino et al.

Summary: Pangenome graphs provide a complete representation of genomic diversity, but analyzing large-scale genome data using existing tools is challenging. Optimized Dynamic Genome/Graph Implementation (ODGI) is a new tool suite with efficient in-memory representation and support for various operations and visualization. Its parallel execution helps answer complex biological questions quickly.

BIOINFORMATICS (2022)

Article Plant Sciences

Wheat Panache: A pangenome graph database representing presence-absence variation across sixteen bread wheat genomes

Philipp E. Bayer et al.

Summary: Bread wheat is a vital crop with a complex genome, making genomic approaches for improvement challenging. To address this, a wheat pangenome graph database has been developed for visualization and comparison of wheat genomes.

PLANT GENOME (2022)

Article Multidisciplinary Sciences

Graph pangenome captures missing heritability and empowers tomato breeding

Yao Zhou et al.

Summary: Constructing a graph pangenome of tomato can improve the estimation of heritability for complex traits, identify more causal structural variants, and facilitate the identification of genetic factors underlying agronomically important traits. This study advances our understanding of the heritability of complex traits and demonstrates the power of the graph pangenome in crop breeding.

NATURE (2022)

Article Biochemistry & Molecular Biology

Exploring transposable element-based markers to identify allelic variations underlying agronomic traits in rice

Haidong Yan et al.

Summary: Transposable elements (TEs) are a major force in the production of new alleles during domestication, but their complexity has limited their use in association studies. A TE genotyping pipeline (TEmarker) was developed and applied to a whole-genome GWAS study on Oryza sativa subsp. japonica to identify genetic elements associated with agronomic traits. TE markers were able to recover a large proportion of SNP-based GWAS peaks and reduce false positives associated with SNP marker linkage disequilibrium. The study also revealed positive selection on TEs associated with agronomic traits and identified insertion and deletion variants potentially resulting from the direct action of TEs.

PLANT COMMUNICATIONS (2022)

Article Biochemical Research Methods

New strategies to improve minimap2 alignment accuracy

Heng Li

Summary: The latest version of minimap2 v2.22 has made improvements in accurately mapping long reads to highly repetitive regions and in allowing alignments through insertions or deletions up to 100 kb by default, addressing major weaknesses in earlier versions.

BIOINFORMATICS (2021)

Article Biochemical Research Methods

Panache: a web browser-based viewer for linearized pangenomes

Eloi Durant et al.

Summary: Pangenomics has evolved to study all sequences available, with linear representations being a suitable option for user-friendliness. Panache is a tool for visualizing gene-based and sequence-based pangenomes, using a linear layout similar to genome browsers.

BIOINFORMATICS (2021)

Review Genetics & Heredity

Building pan-genome infrastructures for crop plants and their use in association genetics

Murukarthick Jayakodi et al.

Summary: Pan-genomic studies aim to represent the entire sequence diversity within a species for evolutionary studies, functional genomics, and breeding of cultivated plants. Advances in sequencing technology and assembly algorithms have made it possible to create multiple reference genomes and catalogue genetic variations in plant species with large and complex genomes.

DNA RESEARCH (2021)

Article Multidisciplinary Sciences

Evolutionary history and pan-genome dynamics of strawberry (Fragaria spp.)

Qin Qiao et al.

Summary: Strawberry has become a model system for various research, with genomes of five different species sequenced in recent years. A study reported chromosome-scale reference genomes for five strawberry species, along with genome resequencing data for 128 additional accessions, providing valuable resources for future studies. The research also identified a new diploid species and explored the evolutionary dynamics of gene families, pointing out mutations associated with white pigmented fruit in different strawberry species.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2021)

Article Genetics & Heredity

A quantitative genomics map of rice provides genetic insights and guides breeding

Xin Wei et al.

Summary: The study emphasizes the importance of genetic diversity in rice breeding by providing a comprehensive map of rice quantitative trait nucleotides (QTNs) and their effects. Population genetic analyses reveal associations between QTN allele frequency changes and domestication, local adaptation, and heterosis. The development of a genome navigation system, RiceNavi, offers an efficient platform for rice breeding.

NATURE GENETICS (2021)

Article Multidisciplinary Sciences

The genetic and epigenetic landscape of the Arabidopsis centromeres

Matthew Naish et al.

Summary: The study used long-read sequencing to assemble the Arabidopsis genome and resolve all five centromeres. It found that centromeres consist of megabase-scale tandemly repeated satellite arrays and are invaded by retrotransposons. The centromeres are evolving through cycles of satellite homogenization and retrotransposon-driven diversification.

SCIENCE (2021)

Article Multidisciplinary Sciences

Pangenomics enables genotyping of known structural variants in 5202 diverse genomes

Jouni Siren et al.

Summary: Giraffe is a pangenome short-read mapper that efficiently maps to a collection of haplotypes threaded through a sequence graph. It speeds up mapping to thousands of human genomes and enables improved accuracy in genome-wide genotyping, ultimately enhancing genomic analyses. This tool facilitates a more comprehensive characterization of variation and has the potential to benefit various genomic studies.

SCIENCE (2021)

Article Plant Sciences

K-mer counting and curated libraries drive efficient annotation of repeats in plant genomes

Bruno Contreras-Moreira et al.

Summary: Annotation of repetitive sequences within plant genomes is crucial for interpretation of observed phenotypes, and k-mer strategies for mask-based annotation are significantly faster than homology-based methods. Testing a two-step approach for repeat annotation showed promising results, with shorter repeated elements compared to direct sequence homology.

PLANT GENOME (2021)

Review Plant Sciences

Plant Pan-Genomics Comes of Age

Li Lei et al.

Summary: A plant pan-genome is typically larger than the genome of any individual plant, with many genes present in only some genomes. Constructing and interpreting plant pan-genomes is challenging due to the large size and repetitive content of plant genomes. Focus is often on nontransposable element protein coding genes, but noncoding and repetitive DNA also play important roles in genome evolution and phenotype determination.

ANNUAL REVIEW OF PLANT BIOLOGY, VOL 72, 2021 (2021)

Article Genetics & Heredity

A high-quality genome assembly highlights rye genomic characteristics and agronomically important genes

Guangwei Li et al.

Summary: The study sequenced the genome of an elite Chinese rye variety, Weining rye, and found that the assembled contigs accounted for 98.47% of the estimated genome size. Further analysis revealed insights into gene duplications and their effects within rye.

NATURE GENETICS (2021)

Article Biochemistry & Molecular Biology

Pan-genome analysis of 33 genetically diverse rice accessions reveals hidden genomic variations

Peng Qin et al.

Summary: The study assembled high-quality genomes of genetically diverse rice accessions, discovering a large number of structural variations and gene copy number variations. By using a graph-based genome, the research explored the impact of these variations on rice domestication and environmental adaptation.
Article Biochemistry & Molecular Biology

Two gap-free reference genomes and a global view of the centromere architecture in rice

Jia-Ming Song et al.

Summary: This study reported the assembly and analysis of gap-free reference genome sequences for two elite Oryza sativa rice varieties, providing a global view of the structure and function of rice centromeres. The research found that all rice centromeric regions share conserved centromere-specific satellite motifs, with a higher similarity of CentO repeats within the same chromosome.

MOLECULAR PLANT (2021)

Article Biochemistry & Molecular Biology

Pan-genome of Raphanus highlights genetic variation and introgression among domesticated, wild, and weedy radishes

Xiaohui Zhang et al.

Summary: The study presents de novo sequence assemblies for various sub-species and varieties of radishes from different regions, revealing genetic variations, gene flow, and genomic diversity. The findings shed light on the genomic evolution and provide insights for genetic improvement of radish crops.

MOLECULAR PLANT (2021)

Review Biotechnology & Applied Microbiology

Current status of structural variation studies in plants

Yuxuan Yuan et al.

Summary: Structural variations (SVs) are common features in plant genomes and play important roles in the phenotypic diversity of plants. With the advancement of genomic technologies, we are now able to better understand and assess SVs with higher resolution. Plant breeders can leverage this research to improve varieties.

PLANT BIOTECHNOLOGY JOURNAL (2021)

Article Genetics & Heredity

PanSVR: Pan-Genome Augmented Short Read Realignment for Sensitive Detection of Structural Variations

Gaoyang Li et al.

Summary: The utilization of pan-genome provides a promising framework to address the limitations of short read alignments and improve the accuracy and sensitivity of SV calling. PanSVR, as a pan-genome-based SV calling approach, effectively identifies SVs especially in repeat-rich regions and novel insertions, surpassing existing tools in sensitivity.

FRONTIERS IN GENETICS (2021)

Article Plant Sciences

The Taxus genome provides insights into paclitaxel biosynthesis

Xingyao Xiong et al.

Summary: This study completed the chromosome-level genome sequencing of Taxus, revealing its genome evolution and the biosynthesis pathways of paclitaxel and taxadiene. A unique physical and functional grouping of CYP725As for paclitaxel biosynthesis was discovered in the Taxus genome, along with a gene cluster mainly formed by gene duplications for taxadiene biosynthesis.

NATURE PLANTS (2021)

Review Agronomy

Oryza pan-genomics: A new foundation for future rice research and improvement

Chao Huang et al.

Summary: This review summarizes the progress in research on genetic and genomic resources in the genus Oryza, as well as sequencing and computational technologies used for rice genome and pan-genome construction. Future work will focus on the methods and challenges in constructing Oryza pan-genomes, providing a basis for exploring genetic diversity.

CROP JOURNAL (2021)

Article Plant Sciences

Extensive variation within the pan-genome of cultivated and wild sorghum

Yongfu Tao et al.

Summary: A study on the sorghum pan-genome sheds light on the genetic diversity and gene content variation within the sorghum primary gene pool. The research suggests that dispensable genes play a crucial role in sorghum adaptation, and reveals that genetic variation is influenced by recombination rate and transposable element content.

NATURE PLANTS (2021)

Article Biotechnology & Applied Microbiology

Impacts of allopolyploidization and structural variation on intraspecific diversification in Brassica rapa

Xu Cai et al.

Summary: Despite the prevalence and recurrence of polyploidization in flowering plants, its impacts on crop intraspecific genome diversification remain largely unknown. This study investigates the consequences of whole-genome triplication in Brassica rapa and uncovers the potential contributions of allopolyploidization on intraspecific diversification. Additionally, the role of structural variations in favorable trait domestication is explored.

GENOME BIOLOGY (2021)

Review Biotechnology & Applied Microbiology

How the pan-genome is changing crop genomics and improvement

Rafael Della Coletta et al.

Summary: Crop genomics has advanced significantly in recent years thanks to improvements in sequencing technology, assembly methods, and computational resources. The study of structural variation within species and the characterization of pan-genomes have revealed extensive genome content variation among individuals within a species, which is revolutionizing crop genomics and improvement.

GENOME BIOLOGY (2021)

Article Genetics & Heredity

Assessing the regulatory potential of transposable elements using chromatin accessibility profiles of maize transposons

Jaclyn M. Noshay et al.

Summary: Transposable elements (TEs) can create regulatory variation in maize genomes by disrupting existing DNA regulatory elements and creating novel ones, potentially leading to changes in gene expression. The presence of TEs interspersed with genes in maize genomes offers opportunities for significant allelic variation. Some TEs carrying accessible chromatin regions (ACRs) have been found to be associated with higher expression of nearby genes, suggesting a role in providing novel regulatory elements. These findings highlight the potential for a subset of TEs to rewire transcriptional responses in eukaryotic genomes.

GENETICS (2021)

Review Genetics & Heredity

Pan-genomics in the human genome era

Rachel M. Sherman et al.

NATURE REVIEWS GENETICS (2020)

Review Genetics & Heredity

Pangenomics Comes of Age: From Bacteria to Plant and Animal Applications

Agnieszka A. Golicz et al.

TRENDS IN GENETICS (2020)

Review Genetics & Heredity

Pangenome Graphs

Jordan M. Eizenga et al.

ANNUAL REVIEW OF GENOMICS AND HUMAN GENETICS, VOL 21, 2020 (2020)

Article Multidisciplinary Sciences

A platinum standard pan-genome resource that represents the population structure of Asian rice

Yong Zhou et al.

SCIENTIFIC DATA (2020)

Article Biochemistry & Molecular Biology

Pan-Genome of Wild and Cultivated Soybeans

Yucheng Liu et al.

Review Plant Sciences

Pan-genome: A promising resource for noncoding RNA discovery in plants

Muhammad Tahir ul Qamar et al.

PLANT GENOME (2020)

Article Biochemical Research Methods

SVIM-asm: structural variant detection from haploid and diploid genome assemblies

David Heller et al.

BIOINFORMATICS (2020)

Article Multidisciplinary Sciences

The barley pan-genome reveals the hidden legacy of mutation breeding

Murukarthick Jayakodi et al.

NATURE (2020)

Article Multidisciplinary Sciences

Progressive Cactus is a multiple-genome aligner for the thousand-genome era

Joel Armstrong et al.

NATURE (2020)

Article Biotechnology & Applied Microbiology

The design and construction of reference pangenome graphs with minigraph

Heng Li et al.

GENOME BIOLOGY (2020)

Article Biotechnology & Applied Microbiology

GraphAligner: rapid and versatile sequence-to-graph alignment

Mikko Rautiainen et al.

GENOME BIOLOGY (2020)

Article Biotechnology & Applied Microbiology

Long-read-based human genomic structural variation detection with cuteSV

Tao Jiang et al.

GENOME BIOLOGY (2020)

Review Plant Sciences

Plant pan-genomes are the new reference

Philipp E. Bayer et al.

NATURE PLANTS (2020)

Article Biotechnology & Applied Microbiology

Genotyping structural variants in pangenome graphs using the vg toolkit

Glenn Hickey et al.

GENOME BIOLOGY (2020)

Article Biochemical Research Methods

Sequence tube maps: making graph genomes intuitive to commuters

Wolfgang Beyer et al.

BIOINFORMATICS (2019)

Editorial Material Biotechnology & Applied Microbiology

Goodbye reference, hello genome graphs

Adam Ameur

NATURE BIOTECHNOLOGY (2019)

Article Biotechnology & Applied Microbiology

Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype

Daehwan Kim et al.

NATURE BIOTECHNOLOGY (2019)

Article Multidisciplinary Sciences

Structural variants exhibit widespread allelic heterogeneity and shape variation in complex traits

Mahul Chakraborty et al.

NATURE COMMUNICATIONS (2019)

Article Biochemical Research Methods

GfaViz: flexible and interactive visualization of GFA sequence graphs

Giorgio Gonnella et al.

BIOINFORMATICS (2019)

Review Biochemistry & Molecular Biology

Exploring and Exploiting Pan-genomics for Crop Improvement

Yongfu Tao et al.

MOLECULAR PLANT (2019)

Article Genetics & Heredity

Fast and accurate genomic analyses using genome graphs

Goran Rakocevic et al.

NATURE GENETICS (2019)

Review Agronomy

Prospects of pan-genomics in barley

Cecile Monat et al.

THEORETICAL AND APPLIED GENETICS (2019)

Article Biotechnology & Applied Microbiology

SyRI: finding genomic rearrangements and local sequence differences from whole-genome assemblies

Manish Goel et al.

GENOME BIOLOGY (2019)

Article Biotechnology & Applied Microbiology

Paragraph: a graph-based structural variant genotyper for short-read sequence data

Sai Chen et al.

GENOME BIOLOGY (2019)

Review Genetics & Heredity

Genome Sequencing and Assembly by Long Reads in Plants

Changsheng Li et al.

GENES (2018)

Article Biochemical Research Methods

Minimap2: pairwise alignment for nucleotide sequences

Heng Li

BIOINFORMATICS (2018)

Review Genetics & Heredity

Contribution of transposable elements in the plant's genome

Mahbod Sahebi et al.

Article Biochemical Research Methods

Accurate detection of complex structural variations using single-molecule sequencing

Fritz J. Sedlazeck et al.

NATURE METHODS (2018)

Article Multidisciplinary Sciences

High contiguity Arabidopsis thaliana genome assembly with a single nanopore flow cell

Todd P. Michael et al.

NATURE COMMUNICATIONS (2018)

Article Biochemical Research Methods

MUMmer4: A fast and versatile genome alignment system

Guillaume Marcais et al.

PLOS COMPUTATIONAL BIOLOGY (2018)

Article Multidisciplinary Sciences

Genome sequence of the progenitor of wheat A subgenome Triticum urartu

Hong-Qing Ling et al.

NATURE (2018)

Article Biotechnology & Applied Microbiology

Variation graph toolkit improves read mapping by representing genetic variation in the reference

Erik Garrison et al.

NATURE BIOTECHNOLOGY (2018)

Article Plant Sciences

Bias in resistance gene prediction due to repeat masking

Philipp E. Bayer et al.

NATURE PLANTS (2018)

Article Genetics & Heredity

The impact of structural variation on human gene expression

Colby Chiang et al.

NATURE GENETICS (2017)

Article Biochemistry & Molecular Biology

Engineering Quantitative Trait Variation for Crop Improvement by Genome Editing

Daniel Rodriguez-Leal et al.

Article Biochemical Research Methods

Coordinates and intervals in graph-based reference genomes

Knut D. Rand et al.

BMC BIOINFORMATICS (2017)

Article Biochemical Research Methods

Graphical pan-genome analysis with compressed suffix trees and the Burrows-Wheeler transform

Uwe Baier et al.

BIOINFORMATICS (2016)

Article Biochemical Research Methods

PanTools: representation, storage and exploration of pan-genomic data

Siavash Sheikhizadeh et al.

BIOINFORMATICS (2016)

Article Biochemical Research Methods

Assemblytics: a web analytics tool for the detection of variants from an assembly

Maria Nattestad et al.

BIOINFORMATICS (2016)

Article Biochemical Research Methods

Minimap and miniasm: fast mapping and de novo assembly for noisy long sequences

Heng Li

BIOINFORMATICS (2016)

Article Multidisciplinary Sciences

The pangenome of an agronomically important crop plant Brassica oleracea

Agnieszka A. Golicz et al.

NATURE COMMUNICATIONS (2016)

Review Genetics & Heredity

Oxford Nanopore MinION Sequencing and Genome Assembly

Hengyun Lu et al.

GENOMICS PROTEOMICS & BIOINFORMATICS (2016)

Article Biochemical Research Methods

Bandage: interactive visualization ofde novogenome assemblies: Fig. 1.

Ryan R. Wick et al.

BIOINFORMATICS (2015)

Article Genetics & Heredity

Transposable Elements Contribute to Activation of Maize Genes in Response to Abiotic Stress

Irina Makarevitch et al.

PLOS GENETICS (2015)

Article Biochemical Research Methods

SplitMEM: a graphical algorithm for pan-genome analysis with suffix skips

Shoshana Marcus et al.

BIOINFORMATICS (2014)

Review Plant Sciences

Structural variation and genome complexity: is dispensable really dispensable?

Fabio Marroni et al.

CURRENT OPINION IN PLANT BIOLOGY (2014)

Article Biochemical Research Methods

The variant call format and VCFtools

Petr Danecek et al.

BIOINFORMATICS (2011)

Article Biochemistry & Molecular Biology

A window into third-generation sequencing

Eric E. Schadt et al.

HUMAN MOLECULAR GENETICS (2010)

Article Multidisciplinary Sciences

The Sorghum bicolor genome and the diversification of grasses

Andrew H. Paterson et al.

NATURE (2009)

Article Multidisciplinary Sciences

Real-Time DNA Sequencing from Single Polymerase Molecules

John Eid et al.

SCIENCE (2009)

Article Multidisciplinary Sciences

The B73 Maize Genome: Complexity, Diversity, and Dynamics

Patrick S. Schnable et al.

SCIENCE (2009)

Review Plant Sciences

Transposable elements and the plant pan-genomes

Michele Morgante et al.

CURRENT OPINION IN PLANT BIOLOGY (2007)

Article Multidisciplinary Sciences

Genome analysis of multiple pathogenic isolates of Streptococcus agalactiae:: Implications for the microbial pan-genome

H Tettelin et al.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2005)