Journal
BIOINFORMATICS
Volume 26, Issue 12, Pages 1488-1492Publisher
OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btq167
Keywords
-
Categories
Funding
- National Institute of Allergy and Infectious Diseases [NIH-N01-AI-30071]
- National Institutes of Health [NIH-N01-AI-30071]
Ask authors/readers for more resources
Motivation: The growth of sequence data has been accompanied by an increasing need to analyze data on distributed computer clusters. The use of these systems for routine analysis requires scalable and robust software for data management of large datasets. Software is also needed to simplify data management and make large-scale bioinformatics analysis accessible and reproducible to a wide class of target users. Results: We have developed a workflow management system named Ergatis that enables users to build, execute and monitor pipelines for computational analysis of genomics data. Ergatis contains preconfigured components and template pipelines for a number of common bioinformatics tasks such as prokaryotic genome annotation and genome comparisons. Outputs from many of these components can be loaded into a Chado relational database. Ergatis was designed to be accessible to a broad class of users and provides a user friendly, web-based interface. Ergatis supports high-throughput batch processing on distributed compute clusters and has been used for data management in a number of genome annotation and comparative genomics projects.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available