4.5 Article

A framework to enhance generalization of deep metric learning methods using general discriminative feature learning and class adversarial neural networks

Journal

APPLIED INTELLIGENCE
Volume 53, Issue 8, Pages 8693-8711

Publisher

SPRINGER
DOI: 10.1007/s10489-022-03959-6

Keywords

Deep metric learning; Similarity embedding; Zero-shot learning; General discriminative feature learning; Adversarial neural network

Ask authors/readers for more resources

This paper presents a framework to enhance the generalization ability of existing deep metric learning methods in zero-shot learning tasks. By employing general yet discriminative representation learning and a class adversarial neural network, the framework addresses the limitations of DML methods in certain applications.
Deep Metric Learning (DML) methods automatically extract features from data and learn a non-linear transformation from the input to a semantically embedding space. Many DML methods focused to enhance the discrimination power of the learned metric by proposing novel sampling strategies or loss functions. This approach is very helpful when both the training and test examples are selected from the same set of categories. However, it is less effective in many applications of DML such as image retrieval and person-reidentification. Here, the DML should learn general semantic concepts from observed classes and employ them to rank or identify objects from unseen categories. Neglecting the generalization ability of the learned representation and just emphasizing to learn a more discriminative embedding on the observed classes may lead to the overfitting problem. To address this limitation, we propose a framework to enhance the generalization power of existing DML methods in a Zero-Shot Learning (ZSL) setting by general yet discriminative representation learning and employing a class adversarial neural network. To learn a general representation, we employ feature maps of intermediate layers in a deep neural network and enhance their discrimination power through an attention mechanism. Besides, a class adversarial network is utilized to force the deep model to seek class invariant features. We evaluate our work on widely used machine vision datasets in a ZSL setting. Extensive experimental results confirm that our framework can improve the generalization of existing DML methods, and it consistently outperforms baseline DML algorithms on unseen classes.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available