☆ 4.7 Article

Consistent penalizing field loss for zero-shot image retrieval

EXPERT SYSTEMS WITH APPLICATIONS (2024)

期刊

EXPERT SYSTEMS WITH APPLICATIONS

卷 236, 期 -, 页码 -

出版社

PERGAMON-ELSEVIER SCIENCE LTD

DOI: 10.1016/j.eswa.2023.121287

关键词

Image retrieval; Zero-shot; Deep metric learning; Computer vision; Deep learning

类别

Computer Science, Artificial Intelligence Engineering, Electrical & Electronic Operations Research & Management Science

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

Zero-shot image retrieval is the task of retrieving images of unseen classes using a query image of the same class. Existing methods for zero-shot image retrieval focus on pushing the decision boundary between intra-class and inter-class similarities. However, using a universal threshold in the inference stage can compromise performance. To address this, we propose a novel Consistent Penalizing Field (CPF) Loss that creates consistent decision boundaries for all classes. Experimental results show that the proposed method outperforms state-of-the-art methods on various datasets.

Zero-shot image retrieval involves retrieving images of unseen classes using a query image of the same class. To determine whether a given image is of the same class as the query image, a universal threshold of similarity measures is needed, as class-specific thresholds are not feasible for unseen classes. However, existing methods for zero-shot image retrieval focus on pushing a margin between intra-class and inter-class similarities for each class during the training phase. This approach can result in varying decision boundaries between intraand inter-class similarities across classes, which could compromise performance when a universal threshold is used in the inference stage. Additionally, for classes with low intra-class variances or inter-class correlations, the pushing force of the margin-pushing approach might be too weak to learn high-quality embeddings. To address these issues, we propose a novel Consistent Penalizing Field (CPF) Loss for zero-shot image retrieval. The proposed method has a single consistent penalizing field for all classes, resulting in similar decision boundaries across classes. By penalizing samples outside the penalizing field, CPF Loss can better utilize the information of samples with highly unbalanced intra-class and inter-class correlations, and improve the discriminative power of DML learning for zero-shot image retrieval. Extensive experiments are conducted on the challenging Shopee Product Matching dataset and other established benchmarks, and the results demonstrate that the proposed method consistently outperforms the state-of-the-art methods. The code is available at https://github.com/cloudlc/CPF.

Consistent penalizing field loss for zero-shot image retrieval

期刊

EXPERT SYSTEMS WITH APPLICATIONS

出版社

PERGAMON-ELSEVIER SCIENCE LTD

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Consistent penalizing field loss for zero-shot image retrieval

期刊

EXPERT SYSTEMS WITH APPLICATIONS

出版社

PERGAMON-ELSEVIER SCIENCE LTD

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文