4.4 Article

ARDA-UNIT recurrent dense self-attention block with adaptive feature fusion for unpaired (unsupervised) image-to-image translation

期刊

IET IMAGE PROCESSING
卷 -, 期 -, 页码 -

出版社

WILEY
DOI: 10.1049/ipr2.12894

关键词

computer vision; image processing

向作者/读者索取更多资源

This paper introduces a generative adversarial network model called ARDA-UNIT, which aims to tackle the challenges of image-to-image translation by generating images close to the target domain while preserving important features of the source domain. The model enhances its generating capability and reduces training parameters by applying a recurrent dense self-attention module in the generator latent space. Experimental results demonstrate that the model achieves better qualities by reducing computational loads, transferring structures effectively, and improving evaluation criteria such as FID, KID, and IS.
One of the most challenging topics in artificial intelligence is image-to-image translation, the purpose of which is generating images close to those in the target domain while preserving the important features of the images in the source domain. In this direction, various types of generative adversarial networks have been developed. ARDA-UNIT, presented in this paper, seeks to meet the main challenges of these networks, that is, producing a high-quality image in a reasonable amount of time, and transferring content between two images with different structures. The proposed recurrent dense self-attention block, applied in ARDA-UNIT's generator latent space, simultaneously increases its generating capability and decreases the training parameters. ARDA-UNIT has a feature extraction module which feeds both the generator and the discriminator. This module uses a new adaptive feature fusion method which combines multi-scale features in such a way that the characteristics of each scale are preserved. The module also uses a pre-trained CNN that reduces the training parameters. Moreover, a feature similarity loss is introduced that guides the model to change the structure of the source domain in accordance with that in the target domain. Experiments performed on different datasets using FID, KID and IS evaluation criteria have shown that the model reduces computational loads, transfers structures well, and achieves better qualities.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据