4.7 Article

PaScaL_TDMA: A library of parallel and scalable solvers for massive tridiagonal systems

Journal

COMPUTER PHYSICS COMMUNICATIONS
Volume 260, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.cpc.2020.107722

Keywords

Parallel Tridiagonal matrix algorithm; Modified Thomas algorithm; Tridiagonal systems; High-performance computing

Funding

  1. KISTI, South Korea [KSC-2018-CHA-0007, K-19-SG-27-01R-1]
  2. Agency for Defense Development of Korea [UD170053GD]
  3. National Research Foundation of Korea (NRF) - Korean government (Ministry of Science and ICT) [NRF-2017R1E1A1A0-3070161, NRF-20151009350]

Ask authors/readers for more resources

This study aims to solve many tridiagonal systems in multi-dimensional partial differential equations efficiently and with reduced communication overhead. A modified Thomas algorithm and a newly designed communication scheme were used for this purpose. Benchmark tests showed significant reduction in communication time compared to global all-to-all communication methods, especially for larger problem sizes and more cores. The computational procedures were implemented in an open-source library called PaScaL_TDMA, and demonstrated good scalability on a cluster system.
The aim of this study is to devise an efficient and scalable computational procedure to solve the many tridiagonal systems in multi-dimensional partial differential equations. The modified Thomas algorithm and a newly designed communication scheme were used to reduce the communication overhead encountered while solving the many tridiagonal systems. Benchmark test results reveal an advantage of the proposed procedures compared to global all-to-all communication methods - a significantly reduced communication time that becomes more prominent for larger problem sizes and greater number of cores. The proposed computational procedures are fully implemented in an open-source library called Parallel and Scalable Library for TDMA (PaScaL_TDMA). Considering a three-dimensional heat conduction problem as a practical example, we obtain good strong and weak scalability results up to 262,144 computing cores on the KISTI Nurion cluster system, which, to the best of our knowledge, is the largest parallel simulation for solving tridiagonal systems. The potential of this library for largescale substantive problems in physics is also demonstrated through direct numerical simulations of the Rayleigh-Benard convection problem, which yielded excellent scalability and accurate results. Program summary Program Title: PaScaL_TDMA CPC Library link to program files: http://dx.doi.org/10.17632/49z6fh94z3.1 Developer's repository link: https://github.com/MPMC-Lab Licensing provisions: MIT Programming language: Fortran90 Nature of problem: Tridiagonal systems for solving multi-dimensional partial differential equations. Solution method: The divide-and-conquer method is used to solve partitioned tridiagonal systems of equations in the distributed memory system. The partitioned tridiagonal systems of equations are transformed into modified sub-matrices using the modified Thomas algorithm [1]. Reduced tridiagonal systems are constructed by collecting the first and last rows of the modified submatrices from each computing core. The newly designed communication scheme based on MPI_Alltoallw accelerates to collect rows and construct the reduced tridiagonal systems. The solutions of the reduced tridiagonal systems are obtained via the sequential Thomas algorithm. Thereafter, the remaining unknowns in the modified sub-matrices are solved using solutions of the reduced tridiagonal systems. (C) 2020 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available