期刊
JOURNAL OF SYSTEMS ARCHITECTURE
卷 113, 期 -, 页码 -出版社
ELSEVIER
DOI: 10.1016/j.sysarc.2020.101897
关键词
Digital signal processing; Heterogeneous computing platform; GPU; Parallel; HOSVD
资金
- CONACyT, Mexico
Efficient prototyping tools can reduce design and validation efforts for enterprises and research centers, leading to shorter time-to-market and increased competitiveness. By implementing high-performance software libraries in a modular way, it is possible to quickly compare prototype designs under different criteria and improve throughput and reusability.
Efficient prototyping is an invaluable resource for modern enterprises and research centers. An efficient prototyping tool exhibits high throughput while maintaining flexibility, and reduces design and validation efforts, resulting in low time-to-market and high competitiveness. This paper presents a modular implementation of high-performance software (SW) libraries running on a Heterogeneous Computing Platform (HCP) based on CPU+GPU. The proposed SW libraries enable a fast and easy comparison of a prototype under different implementation criteria and maintain a high throughput and reusability due to their modular definition. These features accelerate the prototyping task by removing the overhead of designing and validating ad-hoc implementations. The novelty and benefits of this proposal are presented by prototyping and analysis of the multilinear SVD or Higher-Order SVD (HOSVD), an important, widely-used, and computationally demanding tensor decomposition. The mean square error (MSE), processing time, and speedup of this case study show its high performance, while modularity maintains flexibility. The HOSVD prototype reaches a maximum speedup of 17x that of one of the most important implementations in the state of the art.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据