4.7 Article

Omega-Test: A Predictive Early-Z Culling to Improve the Graphics Pipeline Energy-Efficiency

期刊

出版社

IEEE COMPUTER SOC
DOI: 10.1109/TVCG.2021.3087863

关键词

Rendering (computer graphics); Pipelines; Graphics processing units; Graphics; Computer architecture; Geometry; Games; Graphics processors; mobile processors; portable devices; hardware architecture; processor architecture; energy-aware systems; low-power design; hidden line; surface removal; visibility determination

资金

  1. CoCoUnit ERC Advanced Grant through EU's Horizon 2020 Program [833057]
  2. Spanish State Research Agency [TIN2016-75344-R]
  3. ICREA Academia Program
  4. University of Murcia

向作者/读者索取更多资源

This article introduces a microarchitectural technique called Omega-Test to reduce overdraw in tile-based rendering. By leveraging frame-to-frame coherence, the proposed approach utilizes discarded tile information from current frames to predict the visibility of tiles in the next frame. Experimental evaluation shows average energy savings of 26.4% and average speedup of 16.3% for the evaluated benchmarks.
The most common task of GPUs is to render images in real time. When rendering a 3D scene, a key step is to determine which parts of every object are visible in the final image. There are different approaches to solve the visibility problem, the Z-Test being the most common. A main factor that significantly penalizes the energy efficiency of a GPU, especially in the mobile arena, is the so-called overdraw, which happens when a portion of an object is shaded and rendered but finally occluded by another object. This useless work results in a waste of energy; however, a conventional Z-Test only avoids a fraction of it. In this article we present a novel microarchitectural technique, the Omega-Test, to drastically reduce the overdraw on a Tile-Based Rendering (TBR) architecture. Graphics applications have a great degree of inter-frame coherence, which makes the output of a frame very similar to the previous one. The proposed approach leverages the frame-to-frame coherence by using the resulting information of the Z-Test for a tile (a buffer containing all the calculated pixel depths for a tile), which is discarded by nowadays GPUs, to predict the visibility of the same tile in the next frame. As a result, the Omega-Test early identifies occluded parts of the scene and avoids the rendering of non-visible surfaces eliminating costly computations and off-chip memory accesses. Our experimental evaluation shows average EDP savings in the overall GPU/Memory system of 26.4 percent and an average speedup of 16.3 percent for the evaluated benchmarks.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据