3.8 Proceedings Paper

Towards an Efficient Accelerator for DNN-based Remote Sensing Image Segmentation on FPGAs

出版社

IEEE
DOI: 10.1109/FPL.2019.00037

关键词

-

资金

  1. United Kingdom EPSRC [EP/L016796/1, EP/N031768/1, EP/P010040/1, EP/L00058X/1]
  2. Intel
  3. Corerain
  4. Maxeler
  5. Xilinx
  6. SGIIT
  7. EPSRC [EP/S030069/1] Funding Source: UKRI

向作者/读者索取更多资源

Among popular techniques in remote sensing image (RSI) segmentation, Deep Neural Networks (DNNs) have gained increasing interest but often require high computation complexity, which largely limits their applicability in on-board space platforms. Therefore, various dedicated hardware designs on FPGAs have been developed to accelerate DNNs. However, it imposes difficulty on the design of efficient accelerators for DNN-based segmentation algorithms, since they need to perform both convolution and deconvolution which are two fundamentally different types of operations. This paper proposes a uniform architecture to efficiently implement both convolution and deconvolution in one vector multiplication module. This architecture is further optimized through exploiting different levels of parallelism and layer fusion to achieve low latency for RSI segmentation tasks. Moreover, an optimized DNN model is developed for real-time RSI segmentation, which shows preferable accuracy compared to other methods. The proposed hardware accelerator efficiently implements the DNN model on Intel's Arria 10 device, demonstrating 1578 GOPS of throughput and 17.4 ms of latency, i.e., 57 images per second.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

3.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据