4.7 Article

Spartan: A Sparsity-Adaptive Framework to Accelerate Deep Neural Network Training on GPUs

期刊

出版社

IEEE COMPUTER SOC
DOI: 10.1109/TPDS.2021.3067825

关键词

Training; Monitoring; Sparse matrices; Graphics processing units; Acceleration; Market research; Engines; DNN; sparsity; GPU

资金

  1. AMD

向作者/读者索取更多资源

The article presents Spartan, a lightweight hardware/software framework to accelerate DNN training on GPU by exploiting activation sparsity, reducing computations and improving efficiency. Spartan provides efficient tools such as sparsity monitor, sparse GEMM algorithm, and compaction engine, achieving significant reductions in sparsity profiling overhead and speeding up training on compute-intensive layers like convolutional layers.
Deep Neural Networks (DNNs) have emerged as an important class of machine learning algorithms, providing accurate solutions to a broad range of applications. Sparsity in activation maps in DNN training presents an opportunity to reduce computations. However, exploiting activation sparsity presents two major challenges: i) profiling activation sparsity during training comes with significant overhead due to computing the degree of sparsity and the data movement; ii) the dynamic nature of activation maps requires dynamic dense-to-sparse conversion during training, leading to significant overhead. In this article, we present Spartan, a lightweight hardware/software framework to accelerate DNN training on a GPU. Spartan provides a cost-effective and programmer-transparent microarchitectural solution to exploit activation sparsity detected during training. Spartan provides an efficient sparsity monitor, a tile-based sparse GEMM algorithm, and a novel compaction engine designed for GPU workloads. Spartan can reduce sparsity profiling overhead by 52.5x on average. For the most compute-intensive layers, i.e., convolutional layers, we can speedup AlexNet by 3.4x, VGGNet-16 by 2.14x, and ResNet-18 by 2.02x, when training on the ImageNet dataset.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据