☆ 4.3 Article

Performance-Energy Trade-off in Modern CMPs

ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION (2021)

Journal

ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION

Volume 18, Issue 1, Pages -

Publisher

ASSOC COMPUTING MACHINERY

DOI: 10.1145/3427092

Keywords

Resource contention; performance-energy trade-off; machine learning

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

Modern processors employ DVFS support for fine-grained control of performance and energy consumption, aiming to minimize battery consumption while limiting performance degradation within predefined limits.

Chip multiprocessors (CMPs) are ubiquitous in all computing systems ranging from high-end servers to mobile devices. In these systems, energy consumption is a critical design constraint as it constitutes the most significant operating cost for computing clouds. Analogous to this, longer battery life continues to be an essential user concern in mobile devices. To optimize on power consumption, modern processors are designed with Dynamic Voltage and Frequency Scaling (DVFS) support at the individual core as well as the uncore level. This allows fine-grained control of performance and energy. For an n core processor with m core and uncore frequency choices, the total DVFS configuration space is now m((n+1)) (with the uncorc accounting for the + 1). In addition to that, in CMPs, the performance-energy trade-off due to core/encore frequency scaling concerning a single application cannot be determined independently as cores share critical resources like the last level cache (LLC) and the memory. Thus, unlike the uni-processor environment, the energy consumption of an application running on a CMP depends not only on its characteristics but also on those of its co-runners (applications running on other cores). The key objective of our work is to select a suitable core and uncore frequency that minimizes power consumption while limiting application performance degradation within certain pre-defined limits (can be termed as QoS requirements). The key contribution of our work is a learning-based model that is able to capture the interference due to shared cache, bus bandwidth, and memory bandwidth between applications running on multiple cores and predict near-optimal frequencies for core and uncore.

Performance-Energy Trade-off in Modern CMPs

Journal

ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION

Publisher

ASSOC COMPUTING MACHINERY

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Performance-Energy Trade-off in Modern CMPs

Journal

ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION

Publisher

ASSOC COMPUTING MACHINERY

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper