4.7 Article

Consistent penalizing field loss for zero-shot image retrieval

期刊

EXPERT SYSTEMS WITH APPLICATIONS
卷 236, 期 -, 页码 -

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.eswa.2023.121287

关键词

Image retrieval; Zero-shot; Deep metric learning; Computer vision; Deep learning

向作者/读者索取更多资源

Zero-shot image retrieval is the task of retrieving images of unseen classes using a query image of the same class. Existing methods for zero-shot image retrieval focus on pushing the decision boundary between intra-class and inter-class similarities. However, using a universal threshold in the inference stage can compromise performance. To address this, we propose a novel Consistent Penalizing Field (CPF) Loss that creates consistent decision boundaries for all classes. Experimental results show that the proposed method outperforms state-of-the-art methods on various datasets.
Zero-shot image retrieval involves retrieving images of unseen classes using a query image of the same class. To determine whether a given image is of the same class as the query image, a universal threshold of similarity measures is needed, as class-specific thresholds are not feasible for unseen classes. However, existing methods for zero-shot image retrieval focus on pushing a margin between intra-class and inter-class similarities for each class during the training phase. This approach can result in varying decision boundaries between intraand inter-class similarities across classes, which could compromise performance when a universal threshold is used in the inference stage. Additionally, for classes with low intra-class variances or inter-class correlations, the pushing force of the margin-pushing approach might be too weak to learn high-quality embeddings. To address these issues, we propose a novel Consistent Penalizing Field (CPF) Loss for zero-shot image retrieval. The proposed method has a single consistent penalizing field for all classes, resulting in similar decision boundaries across classes. By penalizing samples outside the penalizing field, CPF Loss can better utilize the information of samples with highly unbalanced intra-class and inter-class correlations, and improve the discriminative power of DML learning for zero-shot image retrieval. Extensive experiments are conducted on the challenging Shopee Product Matching dataset and other established benchmarks, and the results demonstrate that the proposed method consistently outperforms the state-of-the-art methods. The code is available at https://github.com/cloudlc/CPF.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据