Journal
PARALLEL COMPUTING: TECHNOLOGY TRENDS
Volume 36, Issue -, Pages 219-228Publisher
IOS PRESS
DOI: 10.3233/APC200044
Keywords
Intel Xeon Phi; MCDRAM; Sparse Matrix-VectorMultiplication; Maximum Likelihood Expectation-Maximization; Positron Emission Tomography
Categories
Funding
- German Federal Ministry for Education and Research [01\H16010D]
- Leibniz Supercomputer Centre [pr63qi]
Ask authors/readers for more resources
Memory bandwidth plays an essential role in high performance computing. Its impact on system performance is evident when running applications with a low arithmetic intensity. Therefore, high bandwidth memory is on the agenda of many vendors. However, depending on the memory architecture, other optimizations are required to exploit the performance gain from high bandwidth memory technology. In this paper, we present our optimizations for the Maximum Likelihood Expectation-Maximization (MLEM) algorithm, a method for positron emission tomography (PET) image reconstruction, with a sparse matrix-vector (SpMV) kernel. The results show significant improvement in performance when executing the code on an Intel Xeon Phi processor with MCDRAM when compared to multi-channel DRAM. We further identify that the latency of the MCDRAM becomes a new limiting factor, requiring further optimization. Ultimately, after implementing cache-blocking optimization, we achieved a total memory bandwidth of up to 180 GB/s for the SpMV operation.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available