☆ 4.6 Article

An Intelligent Framework for Oversubscription Management in CPU-GPU Unified Memory

JOURNAL OF GRID COMPUTING (2023)

期刊

JOURNAL OF GRID COMPUTING

卷 21, 期 1, 页码 -

出版社

SPRINGER

DOI: 10.1007/s10723-023-09646-1

关键词

Discrete CPU-GPU system; Unified virtual memory; Oversubscription; Deep learning

类别

Computer Science, Information Systems Computer Science, Theory & Methods

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This paper proposes a novel framework for UVM oversubscription management in discrete CPU-GPU systems, which significantly outperforms the state-of-the-art methods on memory-intensive benchmarks. It reduces page thrashing and achieves improved IPC under different levels of memory oversubscription.

Unified virtual memory (UVM) improves GPU programmability by enabling on-demand data movement between CPU memory and GPU memory. However, due to the limited capacity of GPU device memory, oversubscription overhead becomes a major performance bottleneck for data-intensive workloads running on GPUs with UVM. This paper proposes a novel framework for UVM oversubscription management in discrete CPU-GPU systems. It consists of an access pattern classifier followed by a pattern-specific transformer-based model using a novel loss function aiming to reduce page thrashing. A policy engine is designed to leverage the model's result to perform accurate page prefetching and eviction. Our evaluation shows that our proposed framework significantly outperforms the state-of-the-art (SOTA) methods on a set of 11 memory-intensive benchmarks, reducing the number of pages thrashed by 64.4% under 125% memory oversubscription compared to the baseline, while the SOTA method reduces the number of pages thrashed by 17.3%. Compared to the SOTA method, our solution achieves average IPC improvement of 1.52X and 3.66X under 125% and 150% memory oversubscription.

An Intelligent Framework for Oversubscription Management in CPU-GPU Unified Memory

期刊

JOURNAL OF GRID COMPUTING

出版社

SPRINGER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

An Intelligent Framework for Oversubscription Management in CPU-GPU Unified Memory

期刊

JOURNAL OF GRID COMPUTING

出版社

SPRINGER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文