期刊
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY
卷 33, 期 5, 页码 2410-2423出版社
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TCSVT.2022.3222418
关键词
Video compression; Image coding; Streaming media; Ions; Standards; Interpolation; Prediction algorithms; Deep learning; video compression; in-loop prediction
In recent years, there has been an increasing interest in end-to-end learned video compression. Previous works focused on compressing motion maps to exploit temporal redundancy. However, they did not fully utilize historical information in sequential reference frames. This paper proposes an Advanced Learned Video Compression (ALVC) approach with an in-loop frame prediction module, which effectively predicts the target frame from previously compressed frames. The experiments demonstrate the state-of-the-art performance of ALVC in learned video compression.
Recent years have witnessed an increasing interest in end-to-end learned video compression. Most previous works explore temporal redundancy by detecting and compressing a motion map to warp the reference frame towards the target frame. Yet, it failed to adequately take advantage of the historical priors in the sequential reference frames. In this paper, we propose an Advanced Learned Video Compression (ALVC) approach with the in-loop frame prediction module, which is able to effectively predict the target frame from the previously compressed frames, without consuming any bit-rate. The predicted frame can serve as a better reference than the previously compressed frame, and therefore it benefits the compression performance. The proposed in-loop prediction module is a part of the end-to-end video compression and is jointly optimized in the whole framework. We propose the recurrent and the bi-directional in-loop prediction modules for compressing P-frames and B-frames, respectively. The experiments show the state-of-the-art performance of our ALVC approach in learned video compression. We also outperform the default hierarchical B mode of x265 in terms of PSNR and beat the slowest mode of the SSIM-tuned x265 on MS-SSIM. The project page: https://github.com/RenYang-home/ALVC.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据