☆ 4.7 Article

Implementing molecular dynamics on hybrid high performance computers - short range forces

COMPUTER PHYSICS COMMUNICATIONS (2011)

期刊

COMPUTER PHYSICS COMMUNICATIONS

卷 182, 期 4, 页码 898-911

出版社

ELSEVIER

DOI: 10.1016/j.cpc.2010.12.021

关键词

Molecular dynamics; GPU; Hybrid parallel computing

类别

Computer Science, Interdisciplinary Applications Physics, Mathematical

资金

Office of Advanced Scientific Computing Research, Office of Science, U.S. Department of Energy [DE-AC05-00OR22725]
Office of Science of the U.S. Department of Energy [DE-AC05-000R22725]
U.S. Department of Energy [DE-AC04-94AL85000]
CSRF program at Sandia National Laboratories

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

The use of accelerators such as graphics processing units (GPUs) has become popular in scientific computing applications due to their low cost, impressive floating-point capabilities, high memory bandwidth, and low electrical power requirements. Hybrid high-performance computers, machines with more than one type of floating-point processor, are now becoming more prevalent due to these advantages. In this work, we discuss several important issues in porting a large molecular dynamics code for use on parallel hybrid machines - (1) choosing a hybrid parallel decomposition that works on central processing units (CPUs) with distributed memory and accelerator cores with shared memory, (2) minimizing the amount of code that must be ported for efficient acceleration, (3) utilizing the available processing power from both multi-core CPUs and accelerators, and (4) choosing a programming model for acceleration. We present our solution to each of these issues for short-range force calculation in the molecular dynamics package LAMMPS, however, the methods can be applied in many molecular dynamics codes. Specifically, we describe algorithms for efficient short range force calculation on hybrid high-performance machines. We describe an approach for dynamic load balancing of work between CPU and accelerator cores. We describe the Geryon library that allows a single code to compile with both CUDA and OpenCL for use on a variety of accelerators. Finally, we present results on a parallel test cluster containing 32 Fermi GPUs and 180 CPU cores. (C) 2010 Elsevier B.V. All rights reserved.

Implementing molecular dynamics on hybrid high performance computers - short range forces

期刊

COMPUTER PHYSICS COMMUNICATIONS

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Implementing molecular dynamics on hybrid high performance computers - short range forces

期刊

COMPUTER PHYSICS COMMUNICATIONS

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文