4.7 Article

RANet: A Reliability-Guided Aggregation Network for Hyperspectral and RGB Fusion Tracking

Journal

REMOTE SENSING
Volume 14, Issue 12, Pages -

Publisher

MDPI
DOI: 10.3390/rs14122765

Keywords

fusion tracking; hyperspectral image; transformer; deep learning

Funding

  1. National Natural Science Foundation of China [62071136, 62002083, 61971153, 61801142]
  2. Heilongjiang Postdoctoral Foundation [LBH-Q20085, LBH-Z20051]

Ask authors/readers for more resources

Object tracking using RGB images may fail when the object's color is similar to the background. Hyperspectral images provide more spectral information for RGB-based trackers, but there is currently no fusion tracking algorithm for hyperspectral and RGB images. The proposed reliability-guided aggregation network (RANet) combines hyperspectral and RGB information to improve tracking performance, with the RANet achieving the best performance accuracy among the tested trackers.
Object tracking based on RGB images may fail when the color of the tracked object is similar to that of the background. Hyperspectral images with rich spectral features can provide more information for RGB-based trackers. However, there is no fusion tracking algorithm based on hyperspectral and RGB images. In this paper, we propose a reliability-guided aggregation network (RANet) for hyperspectral and RGB tracking, which guides the combination of hyperspectral information and RGB information through modality reliability to improve tracking performance. Specifically, a dual branch based on the Transformer Tracking (TransT) structure is constructed to obtain the information of hyperspectral and RGB modalities. Then, a classification response aggregation module is designed to combine the different modality information by fusing the response predicted through the classification head. Finally, the reliability of different modalities is also considered in the aggregation module to guide the aggregation of the different modality information. Massive experimental results on the public dataset composed of hyperspectral and RGB image sequences show that the performance of the tracker based on our fusion method is better than that of the corresponding single-modality tracker, which fully proves the effectiveness of the fusion method. Among them, the RANet tracker based on the TransT tracker achieves the best performance accuracy of 0.709, indicating the effectiveness and superiority of the RANet tracker.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available