4.7 Article

Visual-Semantic Aligned Bidirectional Network for Zero-Shot Learning

Journal

IEEE TRANSACTIONS ON MULTIMEDIA
Volume 25, Issue -, Pages 1649-1664

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TMM.2022.3145666

Keywords

Bidirectional network; generative model; zero-shot learning

Ask authors/readers for more resources

Zero-shot learning aims to recognize unknown categories that are not available during training. Generative models have shown potential to address this problem by synthesizing unseen features based on semantic embeddings. We propose a visual-semantic aligned bidirectional network with cycle consistency to bridge the gap between visual and semantic spaces and generate high-quality unseen features. Two carefully designed strategies are incorporated to improve the overall ZSL performance by enhancing intra-domain class divergence and mitigating inter-domain shift.
Zero-shot learning (ZSL) aims to recognize unknown categories that are unavailable during training. Recently, generative models have shown the potential to address this challenging problem by synthesizing unseen features conditioned on semantic embeddings such as attributes. However, unidirectional generative models cannot guarantee the effective coupling between visual and semantic spaces. To this end, we propose a visual-semantic aligned bidirectional network with cycle consistency to alleviate the gap between these two spaces, generating unseen features of high quality. More importantly, we incorporate two carefully designed strategies into our bidirectional framework to improve the overall ZSL performance. Specifically, we enhance the intra-domain class divergence in both visual and semantic spaces, and in the meantime, mitigate the inter-domain shift to preserve seen-unseen domain discrimination. Experimental results on four standard benchmarks show the superiority of our framework over existing state-of-the-art methods under both conventional and generalized ZSL settings.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available