4.7 Article

Adaptive Ladder Loss for Learning Coherent Visual-Semantic Embedding

Journal

IEEE TRANSACTIONS ON MULTIMEDIA
Volume 25, Issue -, Pages 1133-1147

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TMM.2021.3139210

Keywords

Measurement; Visualization; Semantics; Training; Loss measurement; User experience; Extraterrestrial measurements; Coherent visual-semantic embedding; adaptive ladder loss; hard-contrastive sampling; coherent score

Ask authors/readers for more resources

This paper introduces a continuous variable model to capture the relevance degree between queries and multiple candidates, and proposes to learn a coherent embedding space where candidates with higher relevance degrees are mapped closer to the query. A new ladder loss is proposed to implement variable push-away margins based on relevance degrees, and a Silhouette score-based method is proposed to adaptively decide the ladder level. Extensive experiments show that the proposed method outperforms existing state-of-the-art methods.
For visual-semantic embedding, the existing methods normally treat the relevance between queries and candidates in a bipolar way - relevant or irrelevant, and all irrelevant candidates are uniformly pushed away from the query by an equal margin in the embedding space, regardless of their various proximity to the query. This practice disregards relatively discriminative information and could lead to suboptimal ranking in the retrieval results and poorer user experience, especially in the long-tail query scenario where a matching candidate may not necessarily exist. In this paper, we introduce a continuous variable to model the relevance degree between queries and multiple candidates, and propose to learn a coherent embedding space, where candidates with higher relevance degrees are mapped closer to the query than those with lower relevance degrees. In particular, the new ladder loss is proposed by extending the triplet loss inequality to a more general inequality chain, which implements variable push-away margins according to respective relevance degrees. To adapt to the varying mini-batch statistics and improve the efficiency of the ladder loss, we also propose a Silhouette score-based method to adaptively decide the ladder level and hence the underlying inequality chain. In addition, a proper Coherent Score metric is proposed to better measure the ranking results including those irrelevant candidates. Extensive experiments on multiple datasets validate the efficacy of our proposed method, which achieves significant improvement over existing state-of-the-art methods.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available