☆ 4.3 Article

Efficient Data Mapping and Buffering Techniques for Multilevel Cell Phase-Change Memories

ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION (2014)

期刊

ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION

卷 11, 期 4, 页码 -

出版社

ASSOC COMPUTING MACHINERY

DOI: 10.1145/2669365

关键词

Algorithms; Performance; Multilevel cell; phase-change memory; main memory; performance; energy; data mapping; data buffering

类别

Computer Science, Hardware & Architecture Computer Science, Theory & Methods

资金

NSF [0953246, 1147397, 1212962, 1320531]
Intel Science and Technology Center for Cloud Computing
Semiconductor Research Corporation
Intel Memory Hierarchy Program
Division of Computing and Communication Foundations
Direct For Computer & Info Scie & Enginr [0953246, 1147397] Funding Source: National Science Foundation

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

New phase-change memory (PCM) devices have low-access latencies (like DRAM) and high capacities (i.e., low cost per bit, like Flash). In addition to being able to scale to smaller cell sizes than DRAM, a PCM cell can also store multiple bits per cell (referred to as multilevel cell, or MLC), enabling even greater capacity per bit. However, reading and writing the different bits of data from and to an MLC PCM cell requires different amounts of time: one bit is read or written first, followed by another. Due to this asymmetric access process, the bits in an MLC PCM cell have different access latency and energy depending on which bit in the cell is being read or written. We leverage this observation to design a new way to store and buffer data in MLC PCM devices. While traditional devices couple the bits in each cell next to one another in the address space, our key idea is to logically decouple the bits in each cell into two separate regions depending on their read/write characteristics: fast-read/slow-write bits and slow-read/fast-write bits. We propose a low-overhead hardware/software technique to predict and map data that would benefit from being in each region at runtime. In addition, we show how MLC bit decoupling provides more flexibility in the way data is buffered in the device, enabling more efficient use of existing device buffer space. Our evaluations for a multicore system show that MLC bit decoupling improves system performance by 19.2%, memory energy efficiency by 14.4%, and thread fairness by 19.3% over a state-of-the-art MLC PCM system that couples the bits in its cells. We show that our results are consistent across a variety of workloads and system configurations.

Efficient Data Mapping and Buffering Techniques for Multilevel Cell Phase-Change Memories

期刊

ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION

出版社

ASSOC COMPUTING MACHINERY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Efficient Data Mapping and Buffering Techniques for Multilevel Cell Phase-Change Memories

期刊

ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION

出版社

ASSOC COMPUTING MACHINERY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文