期刊
PARALLEL COMPUTING: TECHNOLOGY TRENDS
卷 36, 期 -, 页码 219-228出版社
IOS PRESS
DOI: 10.3233/APC200044
关键词
Intel Xeon Phi; MCDRAM; Sparse Matrix-VectorMultiplication; Maximum Likelihood Expectation-Maximization; Positron Emission Tomography
类别
资金
- German Federal Ministry for Education and Research [01\H16010D]
- Leibniz Supercomputer Centre [pr63qi]
Memory bandwidth plays an essential role in high performance computing. Its impact on system performance is evident when running applications with a low arithmetic intensity. Therefore, high bandwidth memory is on the agenda of many vendors. However, depending on the memory architecture, other optimizations are required to exploit the performance gain from high bandwidth memory technology. In this paper, we present our optimizations for the Maximum Likelihood Expectation-Maximization (MLEM) algorithm, a method for positron emission tomography (PET) image reconstruction, with a sparse matrix-vector (SpMV) kernel. The results show significant improvement in performance when executing the code on an Intel Xeon Phi processor with MCDRAM when compared to multi-channel DRAM. We further identify that the latency of the MCDRAM becomes a new limiting factor, requiring further optimization. Ultimately, after implementing cache-blocking optimization, we achieved a total memory bandwidth of up to 180 GB/s for the SpMV operation.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据