☆ 4.7 Article

Knowledge distillation methods for efficient unsupervised adaptation across multiple domains

IMAGE AND VISION COMPUTING (2021)

期刊

IMAGE AND VISION COMPUTING

卷 108, 期 -, 页码 -

出版社

ELSEVIER

DOI: 10.1016/j.imavis.2021.104096

关键词

Deep learning; Convolutional NNs; Knowledge distillation; Unsupervised domain adaptation; CNN acceleration and compression

类别

Computer Science, Artificial Intelligence Computer Science, Software Engineering Computer Science, Theory & Methods Engineering, Electrical & Electronic Optics

资金

Mathematics of Information Technology and Complex Systems (MITACS)
Natural Sciences and Engineering Research Council of Canada (NSERC)

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

Beyond the complexity of CNNs that require training on large annotated datasets, the domain shift between design and operational data has limited the adoption of CNNs in many real-world applications. Additionally, state-of-the-art CNNs may not be suitable for such real-time applications given their computational requirements. Our proposed approach is compared against state-of-the-art methods for compression and STDA of CNNs on the Office31 and ImageClef-DA image classification datasets, and also against state-of-the-art methods for MTDA on Digits, Office31, and OfficeHome. In both settings, results indicate that our approach can achieve the highest level of accuracy across target domains, while requiring a comparable or lower CNN complexity.

Beyond the complexity of CNNs that require training on large annotated datasets, the domain shift between design and operational data has limited the adoption of CNNs in many real-world applications. For instance, in person re-identification, videos are captured over a distributed set of cameras with non-overlapping viewpoints. The shift between the source (e.g. lab setting) and target (e.g. cameras) domains may lead to a significant decline in recognition accuracy. Additionally, state-of-the-art CNNs may not be suitable for such real-time applications given their computational requirements. Although several techniques have recently been proposed to address domain shift problems through unsupervised domain adaptation (UDA), or to accelerate/compress CNNs through knowledge distillation (KD), we seek to simultaneously adapt and compress CNNs to generalize well across multiple target domains. In this paper, we propose a progressive KD approach for unsupervised single target DA (STDA) and multi-target DA (MTDA) of CNNs. Our method for KD-STDA adapts a CNN to a single target domain by distilling from a larger teacher CNN, trained on both target and source domain data in order to maintain its consistency with a common representation. This method is extended to address MTDA problems, where multiple teachers are used to distill multiple target domain knowledge to a common student CNN. A different target domain is assigned to each teacher model for UDA, and they alternatively distill their knowledge to the student model to preserve specificity of each target, instead of directly combining the knowledge from each teacher using fusion methods. Our proposed approach is compared against state-of-the-art methods for compression and STDA of CNNs on the Office31 and ImageClef-DA image classification datasets. It is also compared against stateof-the-art methods for MTDA on Digits, Office31, and OfficeHome. In both settings ? KD-STDA and KD-MTDA ? results indicate that our approach can achieve the highest level of accuracy across target domains, while requiring a comparable or lower CNN complexity. ? 2021 Published by Elsevier B.V.

Knowledge distillation methods for efficient unsupervised adaptation across multiple domains

期刊

IMAGE AND VISION COMPUTING

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Knowledge distillation methods for efficient unsupervised adaptation across multiple domains

期刊

IMAGE AND VISION COMPUTING

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文