3.8 Proceedings Paper

Joint Noise-Tolerant Learning and Meta Camera Shift Adaptation for Unsupervised Person Re-Identification

Publisher

IEEE COMPUTER SOC
DOI: 10.1109/CVPR46437.2021.00482

Keywords

-

Funding

  1. National Nature Science Foundation of China [61876159, 61806172, 61662024, 62076116, U1705286]
  2. China Postdoctoral Science Foundation [2019M652257]
  3. Fundamental Research Funds for the Central Universities (Xiamen University) [20720200030]
  4. EU H2020 AI4Media [951911]
  5. Italy-China collaboration project [TALENT:2018YFE0118400]

Ask authors/readers for more resources

This paper proposes a unified framework to solve the problem of unsupervised person re-identification by introducing a Dynamic and Symmetric Cross-Entropy loss and a camera-aware meta-learning algorithm. The proposed method effectively addresses noisy samples and camera shift issues, outperforming state-of-the-art methods in both fully unsupervised re-ID and unsupervised domain adaptive re-ID.
This paper considers the problem of unsupervised person re-identification (re-ID), which aims to learn discriminative models with unlabeled data. One popular method is to obtain pseudo-label by clustering and use them to optimize the model. Although this kind of approach has shown promising accuracy, it is hampered by 1) noisy labels produced by clustering and 2) feature variations caused by camera shift. The former will lead to incorrect optimization and thus hinders the model accuracy. The latter will result in assigning the intra-class samples of different cameras to different pseudo-label, making the model sensitive to camera variations. In this paper, we propose a unified framework to solve both problems. Concretely, we propose a Dynamic and Symmetric Cross-Entropy loss (DSCE) to deal with noisy samples and a camera-aware meta-learning algorithm (MetaCam) to adapt camera shift. DSCE can alleviate the negative effects of noisy samples and accommodate the change of clusters after each clustering step. MetaCam simulates cross-camera constraint by splitting the training data into meta-train and meta-test based on camera IDs. With the interacted gradient from meta-train and meta-test, the model is enforced to learn camera-invariant features. Extensive experiments on three re-ID benchmarks show the effectiveness and the complementary of the proposed DSCE and MetaCam. Our method outperforms the state-of-the-art methods on both fully unsupervised re-ID and unsupervised domain adaptive re-ID.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

3.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available