Journal
COMPUTERS & ELECTRICAL ENGINEERING
Volume 62, Issue -, Pages 1-16Publisher
PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.compeleceng.2017.07.014
Keywords
Tridiagonal matrix solvers; Heterogeneous systems; Central processing unit; Graphics processing unit; Extended algorithms
Categories
Funding
- National Key Research and Development Program of China [2016YFB1000400]
Ask authors/readers for more resources
Large-scale tridiagonal matrix solvers based on heterogeneous systems currently cannot balance computational efficiency and numerical stability when solving a non-diagonally dominant matrix. A tridiagonal solver combined central processing unit with graphics processing unit is proposed, based on SPIKE2 as a solver framework, a simplified SPIKE algorithm as a central processing unit component, and a diagonal pivot algorithm as a graphics processing unit component. The solver performance is further improved by using a data-layout-transformation mechanism to obtain continuous addresses, reducing memory communication using constant memory to store unchanged data in the calculation process, and employing a kernel-fusion mechanism to reduce power consumption of graphics processing unit. For a diagonally dominant matrix, extended Thomas algorithms and cycle reduction to replace the graphics processing unit component are proposed in the solver. Experimental results show that the tridiagonal matrix solver in this paper can effectively consider both numerical stability and computational efficiency, and reduce total power consumption while improving memory efficiency. (C) 2017 Elsevier Ltd. All rights reserved.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available