What type of study is this?

This is a Experimental Study study.

September 16, 2025

Zero-shot Image Recognition via Learning Dual Prototype Accordance Across Meta-domains

Key Points

BPRN enhances zero-shot learning by refining dual prototypes for better alignment.
Improvements of 2.1% to 7.3% on benchmark datasets indicate its effectiveness over existing methods.
The proposed framework addresses semantic gaps by disentangling class-level semantics for prototype generation.
Extensive experiments validate BPRN's superior performance and efficiency through ablation studies and visualizations.

Abstract

Zero-shot learning (ZSL) aims to recognize unseen classes by transferring semantic knowledge from seen categories. However, existing methods often struggle with the persistent semantic gap caused by limited semantic descriptors and rigid visual feature modeling. In particular, modeling pre-defined class-level attribute descriptions as ground truth hinders effective semantic-to-visual alignment to some extent. To mitigate these issues, we propose the Bilateral-guided Prototype Refinement Network(BPRN), a novel ZSL framework designed to refine dual prototypes across meta-domains of varying scales. Specifically, we first disentangle the relationships among class-level semantics and use them to generate corresponding pseudo-visual prototypes. Then, by leveraging distribution information across dual prototypes in different meta-domains, BPRN achieves bidirectional calibration between visual-to-semantic and semantic-to-visual modalities. Finally, a synthesized class-level representation derived from the refined dual prototypes is employed for inference, instead of relying on a single prototype. Extensive experiments conducted on five widely-used ZSL benchmark datasets demonstrate that BPRN consistently achieves competitive or even superior performance. Specifically, in the GZSL scenario, BPRN shows improvements of 2.1%, 7.3%, 6.1%, and 4.8% on AWA1, AWA2, SUN, and aPY, respectively, compared to existing embedding-based ZSL methods. Ablation studies and visualization analyses further validate the effectiveness of the proposed components.

Mark Helpful

Bookmark

Relay