☆ 4.3 Article

Software Prefetching for Indirect Memory Accesses: A Microarchitectural Perspective

ACM TRANSACTIONS ON COMPUTER SYSTEMS (2019)

Journal

ACM TRANSACTIONS ON COMPUTER SYSTEMS

Volume 36, Issue 3, Pages -

Publisher

ASSOC COMPUTING MACHINERY

DOI: 10.1145/3319393

Keywords

Software Prefetching; Compiler Analysis; Microarchitecture

Funding

Engineering and Physical Sciences Research Council (EPSRC) [EP/K026399/1, EP/M506485/1]
ARM Ltd.
EPSRC [EP/K026399/1] Funding Source: UKRI

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Many modern data processing and HPC workloads are heavily memory-latency bound. A tempting proposition to solve this is software prefetching, where special non-blocking loads are used to bring data into the cache hierarchy just before being required. However, these are difficult to insert to effectively improve performance, and techniques fir automatic insertion are currently limited. This article develops a novel compiler pass to automatically generate software prefetches for indirect memory accesses, a special class of irregular memory accesses often seen in high-performance workloads. We evaluate this across a wide set of systems, all of which gain benefit from the technique. We then evaluate the extent to which good prefetch instructions are architecture dependent and the class of programs that are particularly amenable. Across a set of memory-bound benchmarks, our automated pass achieves average speedups of 1.3x for an Intel Haswell processor, 1.1x for both an ARM Cortex-A57 and Qualcomm Kryo, 1.2x for a Cortex-72 and an Intel Kaby Lake, and 1.35x for an Intel Xeon Phi Knight's Landing, each of which is an out-of-order core, and performance improvements of 2.1x and 2.7x for the in-order ARM Cortex -A53 and first generation Intel Xeon Phi.

Software Prefetching for Indirect Memory Accesses: A Microarchitectural Perspective

Journal

ACM TRANSACTIONS ON COMPUTER SYSTEMS

Publisher

ASSOC COMPUTING MACHINERY

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Software Prefetching for Indirect Memory Accesses: A Microarchitectural Perspective

Journal

ACM TRANSACTIONS ON COMPUTER SYSTEMS

Publisher

ASSOC COMPUTING MACHINERY

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper