4.7 Article

Advancing Learned Video Compression With In-Loop Frame Prediction

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TCSVT.2022.3222418

关键词

Video compression; Image coding; Streaming media; Ions; Standards; Interpolation; Prediction algorithms; Deep learning; video compression; in-loop prediction

向作者/读者索取更多资源

In recent years, there has been an increasing interest in end-to-end learned video compression. Previous works focused on compressing motion maps to exploit temporal redundancy. However, they did not fully utilize historical information in sequential reference frames. This paper proposes an Advanced Learned Video Compression (ALVC) approach with an in-loop frame prediction module, which effectively predicts the target frame from previously compressed frames. The experiments demonstrate the state-of-the-art performance of ALVC in learned video compression.
Recent years have witnessed an increasing interest in end-to-end learned video compression. Most previous works explore temporal redundancy by detecting and compressing a motion map to warp the reference frame towards the target frame. Yet, it failed to adequately take advantage of the historical priors in the sequential reference frames. In this paper, we propose an Advanced Learned Video Compression (ALVC) approach with the in-loop frame prediction module, which is able to effectively predict the target frame from the previously compressed frames, without consuming any bit-rate. The predicted frame can serve as a better reference than the previously compressed frame, and therefore it benefits the compression performance. The proposed in-loop prediction module is a part of the end-to-end video compression and is jointly optimized in the whole framework. We propose the recurrent and the bi-directional in-loop prediction modules for compressing P-frames and B-frames, respectively. The experiments show the state-of-the-art performance of our ALVC approach in learned video compression. We also outperform the default hierarchical B mode of x265 in terms of PSNR and beat the slowest mode of the SSIM-tuned x265 on MS-SSIM. The project page: https://github.com/RenYang-home/ALVC.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据