☆ 4.4 Article

ARDA-UNIT recurrent dense self-attention block with adaptive feature fusion for unpaired (unsupervised) image-to-image translation

IET IMAGE PROCESSING (2023)

期刊

IET IMAGE PROCESSING

卷 -, 期 -, 页码 -

出版社

WILEY

DOI: 10.1049/ipr2.12894

关键词

computer vision; image processing

类别

Computer Science, Artificial Intelligence Engineering, Electrical & Electronic Imaging Science & Photographic Technology

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This paper introduces a generative adversarial network model called ARDA-UNIT, which aims to tackle the challenges of image-to-image translation by generating images close to the target domain while preserving important features of the source domain. The model enhances its generating capability and reduces training parameters by applying a recurrent dense self-attention module in the generator latent space. Experimental results demonstrate that the model achieves better qualities by reducing computational loads, transferring structures effectively, and improving evaluation criteria such as FID, KID, and IS.

One of the most challenging topics in artificial intelligence is image-to-image translation, the purpose of which is generating images close to those in the target domain while preserving the important features of the images in the source domain. In this direction, various types of generative adversarial networks have been developed. ARDA-UNIT, presented in this paper, seeks to meet the main challenges of these networks, that is, producing a high-quality image in a reasonable amount of time, and transferring content between two images with different structures. The proposed recurrent dense self-attention block, applied in ARDA-UNIT's generator latent space, simultaneously increases its generating capability and decreases the training parameters. ARDA-UNIT has a feature extraction module which feeds both the generator and the discriminator. This module uses a new adaptive feature fusion method which combines multi-scale features in such a way that the characteristics of each scale are preserved. The module also uses a pre-trained CNN that reduces the training parameters. Moreover, a feature similarity loss is introduced that guides the model to change the structure of the source domain in accordance with that in the target domain. Experiments performed on different datasets using FID, KID and IS evaluation criteria have shown that the model reduces computational loads, transfers structures well, and achieves better qualities.

ARDA-UNIT recurrent dense self-attention block with adaptive feature fusion for unpaired (unsupervised) image-to-image translation

期刊

IET IMAGE PROCESSING

出版社

WILEY

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

ARDA-UNIT recurrent dense self-attention block with adaptive feature fusion for unpaired (unsupervised) image-to-image translation

期刊

IET IMAGE PROCESSING

出版社

WILEY

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文