3.8 Proceedings Paper

Reproducible Cross-border High Performance Computing for Scientific Portals

Publisher

IEEE
DOI: 10.1109/eScience55777.2022.00087

Keywords

Reproducibility; Cross-border computing; Workflows; Scientific portals; PlutoF; Galaxy; HPC; Containers

Funding

  1. European Union [857652]

Ask authors/readers for more resources

Reproducing eScience requires addressing several challenges such as automating scientific workflows, providing unambiguous software versions, easy access to input data, and enabling access to High-Performance Computing (HPC) clusters across borders for achieving bit-to-bit reproducibility. This paper presents a solution for accessing remote HPC and cloud compute and data resources from scientific portals across borders, through rigorous container-based packaging of software versions and setup automation, ultimately enhancing reproducibility.
To reproduce eScience, several challenges need to be solved: scientific workflows need to be automated; the involved software versions need to be provided in an unambiguous way; input data needs to be easily accessible; High-Performance Computing (HPC) clusters are often involved and to achieve bit-to-bit reproducibility, it might be even necessary to execute the code on a particular cluster to avoid differences caused by different HPC platforms (and unless this is a scientist's local cluster, it needs to be accessed across (administrative) borders). Preferably, to allow even inexperienced users to (re-)produce results, all should be user-friendly. While some easy-to-use web-based scientific portals support already to access HPC resources, this typically only refers to computing and data resources that are local. By the example of two community-specific portals in the fields of biodiversity and climate research, we present a solution for accessing remote HPC (and cloud) compute and data resources from scientific portals across borders, involving rigorous container-based packaging of the software version and setup automation, thus enhancing reproducibility.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

3.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available