What type of study is this?

September 10, 2025

Multi-level Contextual Prototype Modulation for Compositional Zero-shot Learning

Key Points

MCPM improves performance in compositional zero-shot learning tasks by enhancing feature discrimination.
Using contrastive learning, the method increases visual embedding quality across different attribute-object compositions.
The introduction of a subclass-driven modulator captures nuanced interactions between attributes and objects.
MCPM's minority attribute enhancement strategy synthesizes samples to address data imbalance issues effectively.

Abstract

Compositional Zero-Shot Learning (CZSL) aims to recognize unseen attribute-object compositions by leveraging prior knowledge of known primitives. However, real-world visual features of attributes and objects are often entangled, causing distribution shifts between seen and unseen combinations. Existing methods often ignore intrinsic variations and interactions among primitives, leading to poor feature discrimination and biased predictions. To address these challenges, we propose Multi-level Contextual Prototype Modulation (MCPM), a transformer-based framework with a hierarchical structure that effectively integrates attributes and objects to generate richer visual embeddings. At the feature level, we apply contrastive learning to improve discriminability across compositional tasks. At the prototype level, a subclass-driven modulator captures fine-grained attribute-object interactions, enabling better adaptation to long-tail distributions. Additionally, we introduce a Minority Attribute Enhancement (MAE) strategy that synthesizes virtual samples by mixing attribute classes, further mitigating data imbalance. Experiments on four benchmark datasets (MIT-States, C-GQA, UT-Zappos, and VAW-CZSL) show that MCPM brings significant performance improvements, verifying its effectiveness in complex composition scenes.

Bookmark

Cite This Study

Liu et al. (Wed,) studied this question.

synapsesocial.com/papers/68c19f9c54b1d3bfb60db2c5 https://doi.org/https://doi.org/10.1109/tip.2025.3592560

Also Consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

Bookmark