4.0 Article

Efficient Bitrate Ladder Construction for Content-Optimized Adaptive Video Streaming

期刊

IEEE OPEN JOURNAL OF SIGNAL PROCESSING
卷 2, 期 -, 页码 496-511

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/OJSP.2021.3086691

关键词

Bit rate; Streaming media; Spatial resolution; Encoding; Training; Feature extraction; Testing; Bitrate ladder; adaptive video streaming; rate-quality curves; video compression; HEVC

资金

  1. Netflix Video Coding Group

向作者/读者索取更多资源

The research proposes a method that utilizes machine learning to predict content-optimized bitrate ladder for on-demand video services, aiming to reduce the number of encodes required. The results demonstrate a significant reduction in required encodes compared to exhaustive search and interpolation-based methods, with a slight difference in Bjontegaard Delta Rate.
One of the challenges faced by many video providers is the heterogeneity of network specifications, user requirements, and content compression performance. The universal solution of a fixed bitrate ladder is inadequate in ensuring a high quality of user experience without re-buffering or introducing annoying compression artifacts. However, a content-tailored solution, based on extensively encoding across all resolutions and over a wide quality range is highly expensive in terms of computational, financial, and energy costs. Inspired by this, we propose an approach that exploits machine learning to predict a content-optimized bitrate ladder for on-demand video services. The method extracts spatio-temporal features from the uncompressed content, trains machine-learning models to predict the Pareto front parameters and, based on that, builds the ladder within a defined bitrate range. The method has the benefit of significantly reducing the number of encodes required per sequence. The presented results, based on 100 HEVC-encoded sequences, demonstrate a reduction in the number of encodes required when compared to an exhaustive search and an interpolation-based method, by 89.06% and 61.46%, respectively, at the cost of an average Bjontegaard Delta Rate difference of 1.78% compared to the exhaustive approach. Finally, a hybrid method is introduced that selects either the proposed or the interpolation-based method depending on the sequence features. This results in an overall 83.83% reduction of required encodings at the cost of an average Bjontegaard Delta Rate difference of 1.26%.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.0
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据