4.7 Article

Wavelet Transform-Assisted Adaptive Generative Modeling for Colorization

期刊

IEEE TRANSACTIONS ON MULTIMEDIA
卷 25, 期 -, 页码 4547-4562

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TMM.2022.3177933

关键词

Automatic colorization; wavelet transform; unsupervised learning; generative model; multi-scale

向作者/读者索取更多资源

This study proposes a novel scheme that utilizes a score-based generative model in the wavelet domain to improve the image colorization task. By leveraging the properties of wavelet transform, the model effectively learns richer priors and reduces the dimension of the data manifold. Additionally, by applying data-consistency and structure-consistency conditions, remarkable improvements are achieved in both generation and colorization quality.
Unsupervised deep learning has recently demonstrated the promise of producing high-quality samples. While it has tremendous potential to promote the image colorization task, the performance is limited owing to the high-dimension of data manifold and model capability. This study presents a novel scheme that exploits the score-based generative model in wavelet domain to address the issues. By taking advantage of the multi-scale and multi-channel representation via wavelet transform, the proposed model learns the richer priors from stacked coarse and detailed wavelet coefficient components jointly and effectively. This strategy also reduces the dimension of the original manifold and alleviates the curse of dimensionality, which is beneficial for estimation and sampling. Moreover, dual consistency terms in the wavelet domain, namely data-consistency and structure-consistency are devised to leverage colorization task better. Specifically, in the training phase, a set of multi-channel tensors consisting of wavelet coefficients is used as the input to train the network with denoising score matching. In the inference phase, samples are iteratively generated via annealed Langevin dynamics with data and structure consistencies. Experiments demonstrated remarkable improvements of the proposed method on both generation and colorization quality, particularly in colorization robustness and diversity.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据