3.8 Proceedings Paper

Robust Mean Teacher for Continual and Gradual Test-Time Adaptation

Test-time adaptation (TTA) is a method proposed to deal with domain shifts during testing, and recently there emerged the area of continual and gradual TTA. In contrast to standard TTA which considers a single domain shift, continual TTA considers a sequence of shifts, and gradual TTA further exploits the property that some shifts evolve gradually over time. In this work, the authors propose that the symmetric cross-entropy is better suited as a consistency loss for mean teachers in TTA setting compared to the commonly used cross-entropy. The effectiveness of the proposed method robust mean teacher (RMT) is demonstrated on various benchmarks.
Since experiencing domain shifts during test-time is inevitable in practice, test-time adaption (TTA) continues to adapt the model after deployment. Recently, the area of continual and gradual test-time adaptation (TTA) emerged. In contrast to standard TTA, continual TTA considers not only a single domain shift, but a sequence of shifts. Gradual TTA further exploits the property that some shifts evolve gradually over time. Since in both settings long test sequences are present, error accumulation needs to be addressed for methods relying on self-training. In this work, we propose and show that in the setting of TTA, the symmetric cross-entropy is better suited as a consistency loss for mean teachers compared to the commonly used crossentropy. This is justified by our analysis with respect to the (symmetric) cross-entropy's gradient properties. To pull the test feature space closer to the source domain, where the pre-trained model is well posed, contrastive learning is leveraged. Since applications differ in their requirements, we address several settings, including having source data available and the more challenging source-free setting. We demonstrate the effectiveness of our proposed method robust mean teacher (RMT) on the continual and gradual corruption benchmarks CIFAR10C, CIFAR100C, and Imagenet-C. We further consider ImageNet-R and propose a new continual DomainNet-126 benchmark. State-of-the-art results are achieved on all benchmarks. (1)

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

3.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available