☆ 4.7 Article

Visible-Infrared Person Re-Identification via Homogeneous Augmented Tri-Modal Learning

IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY (2021)

Journal

IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY

Volume 16, Issue -, Pages 728-739

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TIFS.2020.3001665

Keywords

Gray-scale; Image color analysis; Cameras; Training; Task analysis; Face recognition; Surveillance; Person re-identification (Re-ID); multi-modality; ranking

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

The paper proposes a Homogeneous Augmented Tri-Modal (HAT) learning method for matching person images between daytime visible modality and nighttime infrared modality, which significantly outperforms the current state-of-the-art methods by generating a grayscale auxiliary modality to enforce structure relations across multiple modalities.

Matching person images between the daytime visible modality and night-time infrared modality (VI-ReID) is a challenging cross-modality pedestrian retrieval problem. Existing methods usually learn the multi-modality features in raw image, ignoring the image-level discrepancy. Some methods apply GAN technique to generate the cross-modality images, but it destroys the local structure and introduces unavoidable noise. In this paper, we propose a Homogeneous Augmented Tri-Modal (HAT) learning method for VI-ReID, where an auxiliary grayscale modality is generated from their homogeneous visible images, without additional training process. It preserves the structure information of visible images and approximates the image style of infrared modality. Learning with the grayscale visible images enforces the network to mine structure relations across multiple modalities, making it robust to color variations. Specifically, we solve the tri-modal feature learning from both multi-modal classification and multi-view retrieval perspectives. For multi-modal classification, we learn a multi-modality sharing identity classifier with a parameter-sharing network, trained with a homogeneous and heterogeneous identification loss. For multi-view retrieval, we develop a weighted tri-directional ranking loss to optimize the relative distance across multiple modalities. Incorporated with two invariant regularizers, HAT simultaneously minimizes multiple modality variations. In-depth analysis demonstrates the homogeneous grayscale augmentation significantly outperforms the current state-of-the-art by a large margin.

Visible-Infrared Person Re-Identification via Homogeneous Augmented Tri-Modal Learning

Journal

IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Visible-Infrared Person Re-Identification via Homogeneous Augmented Tri-Modal Learning

Journal

IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper