4.5 Article

A uniform approach for programming distributed heterogeneous computing systems

Journal

JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING
Volume 74, Issue 12, Pages 3228-3239

Publisher

ACADEMIC PRESS INC ELSEVIER SCIENCE
DOI: 10.1016/j.jpdc.2014.08.002

Keywords

OpenCL; MPI; Distributed computing; Heterogeneous computing; Programming model; Runtime system

Funding

  1. FWF Austrian Science Fund [TRP 220-N23]
  2. FWF Doctoral School CIM Computational Interdisciplinary Modelling [W01227]

Ask authors/readers for more resources

Large-scale compute clusters of heterogeneous nodes equipped with multi-core CPUs and GPUs are getting increasingly popular in the scientific community. However, such systems require a combination of different programming paradigms making application development very challenging. In this article we introduce libWater, a library-based extension of the OpenCL programming model that simplifies the development of heterogeneous distributed applications. libWater consists of a simple interface, which is a transparent abstraction of the underlying distributed architecture, offering advanced features such as inter-context and inter-node device synchronization. It provides a runtime system which tracks dependency information enforced by event synchronization to dynamically build a DAG of commands, on which we automatically apply two optimizations: collective communication pattern detection and device-host-device copy removal. We assess libWater's performance in three compute clusters available from the Vienna Scientific Cluster, the Barcelona Supercomputing Center and the University of Innsbruck, demonstrating improved performance and scaling with different test applications and configurations. (C) 2014 The Authors. Published by Elsevier Inc.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available