Journal
IEEE TRANSACTIONS ON MULTIMEDIA
Volume 25, Issue -, Pages 1649-1664Publisher
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TMM.2022.3145666
Keywords
Bidirectional network; generative model; zero-shot learning
Ask authors/readers for more resources
Zero-shot learning aims to recognize unknown categories that are not available during training. Generative models have shown potential to address this problem by synthesizing unseen features based on semantic embeddings. We propose a visual-semantic aligned bidirectional network with cycle consistency to bridge the gap between visual and semantic spaces and generate high-quality unseen features. Two carefully designed strategies are incorporated to improve the overall ZSL performance by enhancing intra-domain class divergence and mitigating inter-domain shift.
Zero-shot learning (ZSL) aims to recognize unknown categories that are unavailable during training. Recently, generative models have shown the potential to address this challenging problem by synthesizing unseen features conditioned on semantic embeddings such as attributes. However, unidirectional generative models cannot guarantee the effective coupling between visual and semantic spaces. To this end, we propose a visual-semantic aligned bidirectional network with cycle consistency to alleviate the gap between these two spaces, generating unseen features of high quality. More importantly, we incorporate two carefully designed strategies into our bidirectional framework to improve the overall ZSL performance. Specifically, we enhance the intra-domain class divergence in both visual and semantic spaces, and in the meantime, mitigate the inter-domain shift to preserve seen-unseen domain discrimination. Experimental results on four standard benchmarks show the superiority of our framework over existing state-of-the-art methods under both conventional and generalized ZSL settings.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available