4.7 Article

SDE-RAE:CLIP-based realistic image reconstruction and editing network using stochastic differential diffusion

期刊

IMAGE AND VISION COMPUTING
卷 139, 期 -, 页码 -

出版社

ELSEVIER
DOI: 10.1016/j.imavis.2023.104836

关键词

Stochastic differential equations; GANs; Image reconstruction; Image editing; CLIP; Diffusion model

向作者/读者索取更多资源

Generative Adversarial Networks (GANs) have been widely used in image reconstruction and editing. However, they are difficult to train and converge. To overcome this challenge, we propose a realistic image reconstruction and editing method based on Stochastic Differential Equation (SDE-RAE), which achieves high-quality image reconstruction with simple loss functions and interferes with parameter optimization using semantic enhancement.
Generative Adversarial Networks (GANs) has long dominated the field of image reconstruction and editing. It is capable to train a generator in an adversarial way, which can fool the discriminator and enable the generated image to be of high quality. However, this approach is often difficult to train, and the final result is hard to converge. Each different style of image requires construction of different datasets and complex optimization functions, and the training process is uncertain. To solve this problem, we propose a realistic image reconstruction and editing method based on Stochastic Differential Equation (SDE-RAE), where the diffusion model converts Gaussian noise to real photos by iterative denoising. What we only need to do is to construct simple loss functions in the reconstruction process to achieve high-quality image reconstruction, and we propose a novel semantic enhancement CLIP (Contrastive Language-Image Pre-Training) to interfere with the SDE parameter optimization direction in the editing process. Simple text is needed to achieve unique image editing. Our method generates high-quality images that retain the texture and contour features of the original image. Specifically, we manipulate the initial image, perturb the image by adding random noise, and then iteratively denoise the image by reverse SDE, manipulating the image's RGB pixels to achieve image reconstruction and editing. Code and dataset https://github.com/haizhu12/SDE-RAE.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据