☆ 4.7 Article

Aggregated Contextual Transformations for High-Resolution Image Inpainting

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS (2023)

期刊

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS

卷 29, 期 7, 页码 3266-3280

出版社

IEEE COMPUTER SOC

DOI: 10.1109/TVCG.2022.3156949

关键词

Generators; Generative adversarial networks; Cognition; Training; Task analysis; Filling; Convolution; Image synthesis; image inpainting; object removal; generative adversarial networks (GAN)

类别

Computer Science, Software Engineering

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

Image inpainting is a promising but challenging task. We propose an enhanced GAN-based model, AOT-GAN, for high-resolution image inpainting. The model enhances context reasoning by stacking multiple layers of AOT blocks and improves texture synthesis through training the discriminator with a tailored mask-prediction task. Extensive comparisons and user study show the superiority of our model.

Image inpainting that completes large free-form missing regions in images is a promising yet challenging task. State-of-the-art approaches have achieved significant progress by taking advantage of generative adversarial networks (GAN). However, these approaches can suffer from generating distorted structures and blurry textures in high-resolution images (e.g., $512\times 512$512x512). The challenges mainly drive from (1) image content reasoning from distant contexts, and (2) fine-grained texture synthesis for a large missing region. To overcome these two challenges, we propose an enhanced GAN-based model, named Aggregated COntextual-Transformation GAN (AOT-GAN), for high-resolution image inpainting. Specifically, to enhance context reasoning, we construct the generator of AOT-GAN by stacking multiple layers of a proposed AOT block. The AOT blocks aggregate contextual transformations from various receptive fields, allowing to capture both informative distant image contexts and rich patterns of interest for context reasoning. For improving texture synthesis, we enhance the discriminator of AOT-GAN by training it with a tailored mask-prediction task. Such a training objective forces the discriminator to distinguish the detailed appearances of real and synthesized patches, and in turn facilitates the generator to synthesize clear textures. Extensive comparisons on Places2, the most challenging benchmark with 1.8 million high-resolution images of 365 complex scenes, show that our model outperforms the state-of-the-art. A user study including more than 30 subjects further validates the superiority of AOT-GAN. We further evaluate the proposed AOT-GAN in practical applications, e.g., logo removal, face editing, and object removal. Results show that our model achieves promising completions in the real world. We release codes and models in https://github.com/researchmm/AOT-GAN-for-Inpainting.

Aggregated Contextual Transformations for High-Resolution Image Inpainting

期刊

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS

出版社

IEEE COMPUTER SOC

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Aggregated Contextual Transformations for High-Resolution Image Inpainting

期刊

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS

出版社

IEEE COMPUTER SOC

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文