4.5 Article

Fast filter bank convolution for three-dimensional wavelet transform by shared memory on mobile GPU computing

期刊

JOURNAL OF SUPERCOMPUTING
卷 71, 期 9, 页码 3440-3455

出版社

SPRINGER
DOI: 10.1007/s11227-015-1443-7

关键词

One-level DWT; Three-dimensional DWT; Mobile GPU computing; Video processing; Pixel parallelization; Shared memory; Bank conflict

向作者/读者索取更多资源

Mobile GPU applications usually constrain by the real-time requirement. However, FLOPS of mobile GPU is limited by the size and power supply of the SoC systems. Same to desktop GPUs, the mobile GPU consists of an on-chip memory hierarchy, and proper usage of memory hierarchy accelerates mobile GPU applications such as Discrete Wavelet Transform (DWT) to satisfy the real-time requirement. In this paper, by taking advantage of GPU shared memory in Tegra K1, a mobile GPU from Nvidia, we develop Bank Conflict Free Shared Memory Parallel DWT for mobile GPU applications. Computational results show that, with the display resolution of (EGA), Bank Conflict Free Shared Memory Parallel DWT is significantly faster than SoC CPU-based DWT. Computational results also show that, with the display resolution of (CGA), (VGA), (SVGA) and (XGA), Bank Conflict Free Shared Memory Parallel DWT can generally satisfy the real-time requirement.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据