☆ 4.6 Article

Application of Improved CycleGAN in Laser-Visible Face Image Translation

SENSORS (2022)

Journal

SENSORS

Volume 22, Issue 11, Pages -

Publisher

MDPI

DOI: 10.3390/s22114057

Keywords

CycleGAN; least squares method; identity loss; RRDB module

Funding

Key Basic Research Projects of the Basic Strengthening Program [2020-JCJQ-ZD-071]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

CycleGAN is widely used in various image translations, such as thermal-infrared-visible-image translation, near-infrared-visible-image translation, and shortwave-infrared-visible-image translation. The paper constructs a laser-visible face mapping dataset and introduces improvements to the original CycleGAN model by replacing the loss function and adding an identity loss function to strengthen network constraints on the generator. Experimental results show improved SSIM values and reduced FID values compared to existing models. Additionally, a new RRDB module is introduced for better feature extraction and preservation in profile image translations, further enhancing the performance metrics.

CycleGAN is widely used in various image translations, such as thermal-infrared-visible-image translation, near-infrared-visible-image translation, and shortwave-infrared-visible-image translation. However, most image translations are used for infrared-to-visible translation, and the wide application of laser imaging has an increasingly strong demand for laser-visible image translation. In addition, the current image translation is mainly aimed at frontal face images, which cannot be effectively utilized to translate faces at a certain angle. In this paper, we construct a laser-visible face mapping dataset; in case of the gradient dispersion of the objective function of the original adversarial loss, the least squares loss function is used to replace the cross-entropy loss function and an identity loss function is added to strengthen the network constraints on the generator. The experimental results indicate that the SSIM value of the improved model increases by 1.25%, 8%, 0, 8%, the PSNR value is not much different, and the FID value decreases by 11.22, 12.85, 43.37 and 72.19, respectively, compared with the CycleGAN, Pix2pix, U-GAN-IT and StarGAN models. In the profile image translation, in view of the poor extraction effect of CycleGAN's original feature extraction module ResNet, the RRDB module is used to replace it based on the first improvement. The experimental results show that, compared with the CycleGAN, Pix2pix, U-GAN-IT, StarGAN and the first improved model, the SSIM value of the improved model increased by 3.75%, 10.67%, 2.47%, 10.67% and 2.47%, respectively; the PSNR value increased by 1.02, 2.74, 0.32, 0.66 and 0.02, respectively; the FID value reduced by 26.32, 27.95, 58.47, 87.29 and 15.1, respectively. Subjectively, the contour features and facial features were better conserved.

Application of Improved CycleGAN in Laser-Visible Face Image Translation

Journal

SENSORS

Publisher

MDPI

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Application of Improved CycleGAN in Laser-Visible Face Image Translation

Journal

SENSORS

Publisher

MDPI

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper