4.5 Article

HiMap: Fast and Scalable High-Quality Mapping on CGRA via Hierarchical Abstraction

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TCAD.2021.3132551

关键词

Coarse-grained reconfigurable array (CGRA); systolic arrays

资金

  1. National Research Foundation, Singapore = [NRF-CRP23-2019-0003]
  2. Singapore Ministry of Education Academic Research Fund TI [251RES1905]

向作者/读者索取更多资源

The article introduces a fast and scalable CGRA mapping method called HiMap, which can generate close-to-optimal solutions, improve performance and energy efficiency, and has a short compilation time.
Coarse-grained reconfigurable array (CGRA) has emerged as a promising hardware accelerator due to the excellent balance between reconfigurability, performance, and energy efficiency. The performance of a CGRA strongly depends on the existence of a high-quality compiler to map the application kernels on the architecture. Unfortunately, the state-of-the-art compiler technology falls short in generating high-performance mapping within an acceptable compilation time, especially with increasing CGRA size. We propose HiMap-a fast and scalable CGRA mapping approach-that is also adept at producing closeto-optimal solutions for regular computational kernels prevalent in existing and emerging application domains. The key strategy behind HiMap's efficiency and scalability is to exploit the regularity in the computation by employing a virtual systolic array (VSA) as an intermediate abstraction layer in a hierarchical mapping. HiMap first maps the loop iterations of the kernel onto a VSA and then distills out the unique patterns in the mapping. These unique patterns are subsequently mapped onto subspaces of the physical CGRA. They are arranged together according to the systolic array mapping to create a complete mapping of the kernel. Experimental results confirm that HiMap can generate application mappings that hit the performance envelope of the CGRA. HiMap offers 17.3 x and 5 x improvement in performance and energy efficiency of the mappings compared to the state of the art. The compilation time of HiMap for near-optimal mappings is less than 15 min for 64 x 64 CGRA while existing approaches take days to generate inferior mappings.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据