☆ 4.6 Article

Diffusion Probabilistic Modeling for Video Generation

ENTROPY (2023)

期刊

ENTROPY

卷 25, 期 10, 页码 -

出版社

MDPI

DOI: 10.3390/e25101469

关键词

diffusion models; deep generative models; video generation; autoregressive models

类别

Physics, Multidisciplinary

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

Denoising diffusion probabilistic models are a promising new class of generative models that demonstrate high-quality image generation. This paper presents an autoregressive, end-to-end optimized video diffusion model that surpasses previous methods in perceptual and probabilistic forecasting metrics. Results show significant improvements in perceptual quality and probabilistic frame forecasting ability for various datasets.

Denoising diffusion probabilistic models are a promising new class of generative models that mark a milestone in high-quality image generation. This paper showcases their ability to sequentially generate video, surpassing prior methods in perceptual and probabilistic forecasting metrics. We propose an autoregressive, end-to-end optimized video diffusion model inspired by recent advances in neural video compression. The model successively generates future frames by correcting a deterministic next-frame prediction using a stochastic residual generated by an inverse diffusion process. We compare this approach against six baselines on four datasets involving natural and simulation-based videos. We find significant improvements in terms of perceptual quality and probabilistic frame forecasting ability for all datasets.

Diffusion Probabilistic Modeling for Video Generation

期刊

ENTROPY

出版社

MDPI

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Diffusion Probabilistic Modeling for Video Generation

期刊

ENTROPY

出版社

MDPI

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文