4.8 Article

Reproducible, scalable, and shareable analysis pipelines with bioinformatics workflow managers

相关参考文献

注意:仅列出部分参考文献,下载原文获取全部文献信息。
Review Biochemical Research Methods

Current RNA-seq methodology reporting limits reproducibility

Joel Simoneau et al.

Summary: The translation highlights that the current standard practice in RNA-seq studies often lacks the necessary methodological information, leading to potential reproducibility issues. This work emphasizes the importance of standardized and explicit display of methodological information in RNA-seq experiments.

BRIEFINGS IN BIOINFORMATICS (2021)

Article Computer Science, Software Engineering

Toward a domain-specific language for scientific workflow-based applications on multicloud system

Gennaro Cordasco et al.

Summary: The article introduces a domain-specific language, Fly, which aims to provide a powerful, effective, and pricing-efficient tool for developing scalable workflow-based scientific applications by adopting a multicloud strategy and utilizing different FaaS cloud providers as computational backends.

CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE (2021)

Article Biochemical Research Methods

PoSeiDon: a Nextflow pipeline for the detection of evolutionary recombination events and positive selection

Martin Hoelzer et al.

Summary: PoSeiDon is an easy-to-use pipeline that helps researchers find recombination events and sites under positive selection in protein-coding sequences. The tool builds an alignment, estimates a best-fitting substitution model, performs recombination analysis, and detects positively selected sites according to different models, with results summarized in a user-friendly HTML page.

BIOINFORMATICS (2021)

Review Biology

Streamlining data-intensive biology with workflow systems

Taylor Reiter et al.

Summary: With the increasing scale of biological data generation, the bottleneck of research has shifted from data generation to analysis. Data-centric workflow systems are reshaping the landscape of biological data analysis, empowering researchers to conduct reproducible analyses at scale, but knowledge of these techniques is still lacking.

GIGASCIENCE (2021)

Editorial Material Biochemistry & Molecular Biology

Reproducibility in systems biology modelling

Krishna Tiwari et al.

Summary: The reproducibility of scientific results is crucial for science and credibility. The lack of reproducibility in many scientific fields is a major concern. The article evaluates the reproducibility of mathematical models and suggests a scorecard for enhancing reproducibility in this field.

MOLECULAR SYSTEMS BIOLOGY (2021)

Article Biochemical Research Methods

Using prototyping to choose a bioinformatics workflow management system

Michael Jackson et al.

Summary: Data analysis involves multiple steps, and workflow management systems can help scientists process data more efficiently and provide various benefits, such as enhancing reproducibility and supporting portability. Researchers select a suitable workflow management system for their project through prototyping, emphasizing it as a cost-effective decision-making approach.

PLOS COMPUTATIONAL BIOLOGY (2021)

Article Computer Science, Software Engineering

Why reinventing the wheels? An empirical study on library reuse and re-implementation

Bowen Xu et al.

EMPIRICAL SOFTWARE ENGINEERING (2020)

Article Biotechnology & Applied Microbiology

Butler enables rapid cloud-based analysis of thousands of human genomes

Sergei Yakneen et al.

NATURE BIOTECHNOLOGY (2020)

Letter Biotechnology & Applied Microbiology

The nf-core framework for community-curated bioinformatics pipelines

Philip A. Ewels et al.

NATURE BIOTECHNOLOGY (2020)

Editorial Material Biochemical Research Methods

Bench pressing with genomics benchmarkers

Vivien Marx

NATURE METHODS (2020)

Review Biochemical Research Methods

Scalable Data Analysis in Proteomics and Metabolomics Using BioContainers and Workflows Engines

Yasset Perez-Riverol et al.

PROTEOMICS (2020)

Article Biochemical Research Methods

Tximeta: Reference sequence checksums for provenance identification in RNA-seq

Michael I. Love et al.

PLOS COMPUTATIONAL BIOLOGY (2020)

Article Biochemical Research Methods

Better together: Elements of successful scientific software development in a distributed collaborative community

Julia Koehler Leman et al.

PLoS Computational Biology (2020)

Article Biochemical Research Methods

Seven quick tips for analysis scripts in neuroimaging

Marijn van Vliet

PLoS Computational Biology (2020)

Article Multidisciplinary Sciences

Variability in the analysis of a single neuroimaging dataset by many teams

Rotem Botvinik-Nezer et al.

NATURE (2020)

Article Biochemistry & Molecular Biology

The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2020 update

Vahid Jalili et al.

NUCLEIC ACIDS RESEARCH (2020)

Article Biochemical Research Methods

ATLAS: a Snakemake workflow for assembly, annotation, and genomic binning of metagenome sequence data

Silas Kieser et al.

BMC BIOINFORMATICS (2020)

Article Computer Science, Information Systems

Scalability and cost-effectiveness analysis of whole genome-wide association studies on Google Cloud Platform and Amazon Web Services

Ines Krissaane et al.

JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION (2020)

Article Biochemistry & Molecular Biology

Ten recommendations for supporting open pathogen genomic analysis in public health

Allison Black et al.

NATURE MEDICINE (2020)

Article Computer Science, Information Systems

FAIR Computational Workflows

Carole Goble et al.

DATA INTELLIGENCE (2020)

Article Genetics & Heredity

Factorial study of the RNA-seq computational workflow identifies biases as technical gene signatures

Joel Simoneau et al.

NAR GENOMICS AND BIOINFORMATICS (2020)

Article Medical Laboratory Technology

Practical estimation of cloud storage costs for clinical genomic data

Niklas Krumm et al.

PRACTICAL LABORATORY MEDICINE (2020)

Article Computer Science, Software Engineering

An empirical comparison of dependency network evolution in seven software packaging ecosystems

Alexandre Decan et al.

EMPIRICAL SOFTWARE ENGINEERING (2019)

Article Biochemical Research Methods

Container-based bioinformatics with Pachyderm

Jon Ander Novella et al.

BIOINFORMATICS (2019)

Article Biochemical Research Methods

snakePipes: facilitating flexible, scalable and integrative epigenomic analysis

Vivek Bhardwaj et al.

BIOINFORMATICS (2019)

Editorial Material Cell Biology

In silico analysis of RNA-seq requires a more complete description of methodology

Joel Simoneau et al.

NATURE REVIEWS MOLECULAR CELL BIOLOGY (2019)

Editorial Material Multidisciplinary Sciences

TIPS FOR OPEN-SOURCE SOFTWARE SUPPORT

Anna Nowogrodzki

NATURE (2019)

Editorial Material Multidisciplinary Sciences

THAT'S THE WAY WE FLOW

Jeffrey M. Perkel

NATURE (2019)

Review Biotechnology & Applied Microbiology

Essential guidelines for computational method benchmarking

Lukas M. Weber et al.

GENOME BIOLOGY (2019)

Article Biotechnology & Applied Microbiology

Challenges in funding and developing genomic software: roots and remedies

Adam Siepel

GENOME BIOLOGY (2019)

Article Computer Science, Theory & Methods

Programming models and systems for Big Data analysis

Loris Belcastro et al.

INTERNATIONAL JOURNAL OF PARALLEL EMERGENT AND DISTRIBUTED SYSTEMS (2019)

Article Biochemical Research Methods

VIPER: Visualization Pipeline for RNA-seq, a Snakemake workflow for efficient and complete RNA-seq analysis

MacIntosh Cornwell et al.

BMC BIOINFORMATICS (2018)

Letter Biochemistry & Molecular Biology

LncPipe: A Nextflow-based pipeline for identification and analysis of long non-coding RNAs from RNA-Seq data

Qi Zhao et al.

JOURNAL OF GENETICS AND GENOMICS (2018)

Letter Biochemical Research Methods

Bioconda: sustainable and comprehensive software distribution for the life sciences

Bjoern Gruening et al.

NATURE METHODS (2018)

Article Genetics & Heredity

Cloud computing for genomic data analysis and collaboration

Ben Langmead et al.

NATURE REVIEWS GENETICS (2018)

Article Biochemistry & Molecular Biology

The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2018 update

Enis Afgan et al.

NUCLEIC ACIDS RESEARCH (2018)

Article Multidisciplinary Sciences

An empirical analysis of journal policy effectiveness for computational reproducibility

Victoria Stodden et al.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2018)

Editorial Material Biochemistry & Molecular Biology

Practical Computational Reproducibility in the Life Sciences

Bjoern Gruening et al.

CELL SYSTEMS (2018)

Article Biochemistry & Molecular Biology

Community-Driven Data Analysis Training for Biology

Berenice Batut et al.

CELL SYSTEMS (2018)

Article Biochemical Research Methods

Top considerations for creating bioinformatics software documentation

Mehran Karimzadeh et al.

BRIEFINGS IN BIOINFORMATICS (2018)

Article Biotechnology & Applied Microbiology

GitHub Statistics as a Measure of the Impact of Open-Source Bioinformatics Software

Mikhail G. Dozmorov

FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY (2018)

Article Computer Science, Theory & Methods

Raw data queries during data-intensive parallel workflow execution

Vitor Silva et al.

FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE (2017)

Article Biotechnology & Applied Microbiology

KNIME for reproducible cross-domain analysis of life science data

Alexander Fillbrunn et al.

JOURNAL OF BIOTECHNOLOGY (2017)

Editorial Material Multidisciplinary Sciences

SOFTWARE SIMPLIFIED

Andrew Silver

NATURE (2017)

Article Biotechnology & Applied Microbiology

Reproducibility of computational workflows is automated using continuous analysis

Brett K. Beaulieu-Jones et al.

NATURE BIOTECHNOLOGY (2017)

Letter Biotechnology & Applied Microbiology

Nextflow enables reproducible computational workflows

Paolo Di Tommaso et al.

NATURE BIOTECHNOLOGY (2017)

Letter Biotechnology & Applied Microbiology

Toil enables reproducible, open source, big biomedical data analyses

John Vivian et al.

NATURE BIOTECHNOLOGY (2017)

Editorial Material Biochemical Research Methods

Ten Simple Rules for Developing Usable Software in Computational Biology

Markus List et al.

PLOS COMPUTATIONAL BIOLOGY (2017)

Article Biochemistry & Molecular Biology

Scalability and Validation of Big Data Bioinformatics Software

Andrian Yang et al.

COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL (2017)

Article Multidisciplinary Sciences

Singularity: Scientific containers for mobility of compute

Gregory M. Kurtzer et al.

PLOS ONE (2017)

Article Biochemical Research Methods

Investigating reproducibility and tracking provenance - A genomic workflow case study

Sehrish Kanwal et al.

BMC BIOINFORMATICS (2017)

Article Biochemical Research Methods

BioContainers: an open-source and community-driven framework for software standardization

Felipe da Veiga Leprevost et al.

BIOINFORMATICS (2017)

Proceedings Paper Computer Science, Theory & Methods

Community curation in open dataset repositories: insights from Zenodo

Miguel-Angel Sicilia et al.

13TH INTERNATIONAL CONFERENCE ON CURRENT RESEARCH INFORMATION SYSTEMS, CRIS2016, COMMUNICATING AND MEASURING RESEARCH RESPONSIBLY: PROFILING, METRICS, IMPACT, INTEROPERABILITY (2017)

Review Genetics & Heredity

Coming of age: ten years of next-generation sequencing technologies

Sara Goodwin et al.

NATURE REVIEWS GENETICS (2016)

Article Biochemistry & Molecular Biology

deepTools2: a next generation web server for deep-sequencing data analysis

Fidel Ramirez et al.

NUCLEIC ACIDS RESEARCH (2016)

Article Biochemistry & Molecular Biology

Big Data: Astronomical or Genomical?

Zachary D. Stephens et al.

PLOS BIOLOGY (2015)

Article Computer Science, Information Systems

A modular package manager architecture

Pietro Abate et al.

INFORMATION AND SOFTWARE TECHNOLOGY (2013)

Article Biochemical Research Methods

Bpipe: a tool for running and managing bioinformatics pipelines

Simon P. Sadedin et al.

BIOINFORMATICS (2012)

Article Biochemical Research Methods

Snakemake-a scalable bioinformatics workflow engine

Johannes Koester et al.

BIOINFORMATICS (2012)

Article Biochemistry & Molecular Biology

The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data

Aaron McKenna et al.

GENOME RESEARCH (2010)