4.4 Article

CloudMap: A Cloud-Based Pipeline for Analysis of Mutant Genome Sequences

Journal

GENETICS
Volume 192, Issue 4, Pages 1249-+

Publisher

GENETICS SOCIETY AMERICA
DOI: 10.1534/genetics.112.144204

Keywords

-

Funding

  1. National Institutes of Health (NIH) [R01NS039996-05, R01NS050266-03]
  2. Howard Hughes Medical Institute
  3. F31 pre-doctoral fellowship [1F31NS074841-01]
  4. US National Institutes of Health [HG005133, HG004909, HG006620]
  5. US National Science Foundation [DBI 0850103]
  6. Huck Institutes for the Life Sciences at Penn State
  7. Institute for Cyberscience at Penn State
  8. Pennsylvania Department of Health

Ask authors/readers for more resources

Whole genome sequencing (WGS) allows researchers to pinpoint genetic differences between individuals and significantly shortcuts the costly and time-consuming part of forward genetic analysis in model organism systems. Currently, the most effort-intensive part of WGS is the bioinformatic analysis of the relatively short reads generated by second generation sequencing platforms. We describe here a novel, easily accessible and cloud-based pipeline, called CloudMap, which greatly simplifies the analysis of mutant genome sequences. Available on the Galaxy web platform, CloudMap requires no software installation when run on the cloud, but it can also be run locally or via Amazon's Elastic Compute Cloud (EC2) service. CloudMap uses a series of predefined workflows to pinpoint sequence variations in animal genomes, such as those of premutagenized and mutagenized Caenorhabditis elegans strains. In combination with a variant-based mapping procedure, CloudMap allows users to sharply define genetic map intervals graphically and to retrieve very short lists of candidate variants with a few simple clicks. Automated workflows and extensive video user guides are available to detail the individual analysis steps performed (http://usegalaxy.org/cloudmap). We demonstrate the utility of CloudMap for WGS analysis of C. elegans and Arabidopsis genomes and describe how other organisms (e. g., Zebrafish and Drosophila) can easily be accommodated by this software platform. To accommodate rapid analysis of many mutants from large-scale genetic screens, CloudMap contains an in silico complementation testing tool that allows users to rapidly identify instances where multiple alleles of the same gene are present in the mutant collection. Lastly, we describe the application of a novel mapping/WGS method (Variant Discovery Mapping) that does not rely on a defined polymorphic mapping strain, and we integrate the application of this method into CloudMap. CloudMap tools and documentation are continually updated at http://usegalaxy.org/cloudmap.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available