☆ 4.5 Article

Optimizing Sparse Matrix-Matrix Multiplication for the GPU

ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE (2015)

期刊

ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE

卷 41, 期 4, 页码 -

出版社

ASSOC COMPUTING MACHINERY

DOI: 10.1145/2699470

关键词

Algorithms; Performance; Parallel; sparse; GPU; matrix-matrix

类别

Computer Science, Software Engineering Mathematics, Applied

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Sparse matrix-matrix multiplication (SpGEMM) is a key operation in numerous areas from information to the physical sciences. Implementing SpGEMM efficiently on throughput-oriented processors, such as the graphics processing unit (GPU), requires the programmer to expose substantial fine-grained parallelism while conserving the limited off-chip memory bandwidth. Balancing these concerns, we decompose the SpGEMM operation into three highly parallel phases: expansion, sorting, and contraction, and introduce a set of complementary bandwidth-saving performance optimizations. Our implementation is fully general and our optimization strategy adaptively processes the SpGEMM workload row-wise to substantially improve performance by decreasing the work complexity and utilizing the memory hierarchy more effectively.

Optimizing Sparse Matrix-Matrix Multiplication for the GPU

期刊

ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE

出版社

ASSOC COMPUTING MACHINERY

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Optimizing Sparse Matrix-Matrix Multiplication for the GPU

期刊

ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE

出版社

ASSOC COMPUTING MACHINERY

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文