4.7 Article

Modeling the feasibility of whole genome shotgun sequencing using a pairwise end strategy

Journal

GENOMICS
Volume 68, Issue 3, Pages 237-246

Publisher

ACADEMIC PRESS INC
DOI: 10.1006/geno.2000.6303

Keywords

-

Funding

  1. NIGMS NIH HHS [GM57070] Funding Source: Medline

Ask authors/readers for more resources

In pairwise end sequencing, sequences are determined from both ends of random subclones derived from a DNA target. Sufficiently similar overlapping end sequences are identified and grouped into contigs. When a clone's paired end sequences fall in different contigs, the contigs are connected together to form scaffolds. Increasingly, the goals of pairwise strategies are large and highly repetitive genomic targets. Here, we consider large-scale pairwise strategies that employ mixtures of subclone sizes. We explore the properties of scaffold formation within a hybrid theory/simulation mathematical model of a genomic target that contains many repeat families. Using this model, we evaluate problems that may arise, such as falsely linked end sequences (due either to random matches or to homologous repeats) and scaffolds that terminate without extending the full length of the target. We illustrate our model with an exploration of a strategy for sequencing the human genome. Our results show that, for a strategy that generates 10-fold sequence coverage derived from the ends of clones ranging in length from 2 to 150 kb, using an appropriate rule for detecting over-laps, we expect few false links while obtaining a single scaffold extending the length of each chromosome. (C) 2000 Academic Press.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available