☆ 4.7 Article

Moving Towards Centers: Re-Ranking With Attention and Memory for Re-Identification

IEEE TRANSACTIONS ON MULTIMEDIA (2023)

期刊

IEEE TRANSACTIONS ON MULTIMEDIA

卷 25, 期 -, 页码 3456-3468

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TMM.2022.3161189

关键词

Correlation; Probes; Feature extraction; Transformers; Image reconstruction; Benchmark testing; Aggregates; Re-Identification; Transformer; attention; re-ranking; contextual memory

类别

Computer Science, Information Systems Computer Science, Software Engineering Telecommunications

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This paper proposes a re-ranking network that utilizes contextual information to optimize person or vehicle re-identification (re-ID) ranking list, improving retrieval performance. The network predicts correlations between probe and top-ranked neighbor samples, and expands and enhances the feature embeddings of query and gallery images using a linear combination of their neighbors. The effectiveness of the proposed method is demonstrated through experiments on widely-used person and vehicle re-ID benchmarks.

utilizes contextual information to optimize the initial ranking list of person or vehicle re-identification (re-ID), which boosts the retrieval performance at post-processing steps. This paper proposes a re-ranking network to predict the correlations between the probe and top-ranked neighbor samples. Specifically, all the feature embeddings of query and gallery images are expanded and enhanced by a linear combination of their neighbors, with the correlation prediction serving as discriminative combination weights. The combination process is equivalent to moving independent embeddings toward the identity centers, improving cluster compactness. For correlation prediction, we first aggregate the contextual information for probe's k-nearest neighbors via the Transformer encoder. Then, we distill and refine the probe-related features into the Contextual Memory cell via attention mechanism. Like humans that retrieve images by not only considering probe images but also memorizing the retrieved ones, the Contextual Memory produces multi-view descriptions for each instance. Finally, the neighbors are reconstructed with features fetched from the Contextual Memory, and a binary classifier predicts their correlations with the probe. Experiments on six widely-used person and vehicle re-ID benchmarks demonstrate the effectiveness of the proposed method. Especially, our method surpasses the state-of-the-art re-ranking approaches on large-scale datasets by a significant margin, i.e., with an average 4.83% CMC@1 and 14.83% mAP improvements on VERI-Wild, MSMT17, and VehicleID datasets.

Moving Towards Centers: Re-Ranking With Attention and Memory for Re-Identification

期刊

IEEE TRANSACTIONS ON MULTIMEDIA

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Moving Towards Centers: Re-Ranking With Attention and Memory for Re-Identification

期刊

IEEE TRANSACTIONS ON MULTIMEDIA

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文