☆ 3.8 Proceedings Paper

Fast Online Video Super-Resolution with Deformable Attention Pyramid

2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV) (2023)

期刊

2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV)

卷 -, 期 -, 页码 1735-1744

出版社

IEEE COMPUTER SOC

DOI: 10.1109/WACV56688.2023.00178

关键词

类别

Computer Science, Artificial Intelligence Engineering, Electrical & Electronic Imaging Science & Photographic Technology

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

In this work, we propose a recurrent VSR architecture based on a deformable attention pyramid (DAP) to address the VSR problem with strict causal, real-time, and latency constraints. Our DAP aligns and integrates information from the recurrent state into the current frame prediction, overcoming the challenge of unavailable future frame information. By attending to a limited number of spatial locations dynamically predicted by the DAP, we reduce computational cost compared to traditional attention-based methods. Experimental results show the effectiveness of our approach, achieving a significant speed-up and surpassing state-of-the-art methods on benchmark tests.

Video super-resolution (VSR) has many applications that pose strict causal, real-time, and latency constraints, including video streaming and TV. We address the VSR problem under these settings, which poses additional important challenges since information from future frames is unavailable. Importantly, designing efficient, yet effective frame alignment and fusion modules remain central problems. In this work, we propose a recurrent VSR architecture based on a deformable attention pyramid (DAP). Our DAP aligns and integrates information from the recurrent state into the current frame prediction. To circumvent the computational cost of traditional attention-based methods, we only attend to a limited number of spatial locations, which are dynamically predicted by the DAP. Comprehensive experiments and analysis of the proposed key innovations show the effectiveness of our approach. We significantly reduce processing time and computational complexity in comparison to state-of-the-art methods, while maintaining a high performance. We surpass state-of-the-art method EDVR-M on two standard benchmarks with a speed-up of over 3x.

Fast Online Video Super-Resolution with Deformable Attention Pyramid

期刊

2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV)

出版社

IEEE COMPUTER SOC

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Fast Online Video Super-Resolution with Deformable Attention Pyramid

期刊

2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV)

出版社

IEEE COMPUTER SOC

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文