4.6 Article

Application of Improved CycleGAN in Laser-Visible Face Image Translation

Journal

SENSORS
Volume 22, Issue 11, Pages -

Publisher

MDPI
DOI: 10.3390/s22114057

Keywords

CycleGAN; least squares method; identity loss; RRDB module

Funding

  1. Key Basic Research Projects of the Basic Strengthening Program [2020-JCJQ-ZD-071]

Ask authors/readers for more resources

CycleGAN is widely used in various image translations, such as thermal-infrared-visible-image translation, near-infrared-visible-image translation, and shortwave-infrared-visible-image translation. The paper constructs a laser-visible face mapping dataset and introduces improvements to the original CycleGAN model by replacing the loss function and adding an identity loss function to strengthen network constraints on the generator. Experimental results show improved SSIM values and reduced FID values compared to existing models. Additionally, a new RRDB module is introduced for better feature extraction and preservation in profile image translations, further enhancing the performance metrics.
CycleGAN is widely used in various image translations, such as thermal-infrared-visible-image translation, near-infrared-visible-image translation, and shortwave-infrared-visible-image translation. However, most image translations are used for infrared-to-visible translation, and the wide application of laser imaging has an increasingly strong demand for laser-visible image translation. In addition, the current image translation is mainly aimed at frontal face images, which cannot be effectively utilized to translate faces at a certain angle. In this paper, we construct a laser-visible face mapping dataset; in case of the gradient dispersion of the objective function of the original adversarial loss, the least squares loss function is used to replace the cross-entropy loss function and an identity loss function is added to strengthen the network constraints on the generator. The experimental results indicate that the SSIM value of the improved model increases by 1.25%, 8%, 0, 8%, the PSNR value is not much different, and the FID value decreases by 11.22, 12.85, 43.37 and 72.19, respectively, compared with the CycleGAN, Pix2pix, U-GAN-IT and StarGAN models. In the profile image translation, in view of the poor extraction effect of CycleGAN's original feature extraction module ResNet, the RRDB module is used to replace it based on the first improvement. The experimental results show that, compared with the CycleGAN, Pix2pix, U-GAN-IT, StarGAN and the first improved model, the SSIM value of the improved model increased by 3.75%, 10.67%, 2.47%, 10.67% and 2.47%, respectively; the PSNR value increased by 1.02, 2.74, 0.32, 0.66 and 0.02, respectively; the FID value reduced by 26.32, 27.95, 58.47, 87.29 and 15.1, respectively. Subjectively, the contour features and facial features were better conserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available