What question did this study set out to answer?

The aim is to improve recognition of new object-state compositions using partially labeled data in pCZSL.

February 23, 2026

Partially Supervised Compositional Zero-Shot Learning by Class-Balanced Distribution Alignment

Key Points

The aim is to improve recognition of new object-state compositions using partially labeled data in pCZSL.
Developed an architecture based on a swin transformer to extract features from state and object combinations.
Utilized a Hierarchical Feature Extractor to capture semantic interactions across different object scales.
Implemented a Discriminative Context Aggregation module to analyze features at corresponding scales.
Introduced strongly and weakly augmented image inputs to enhance learning.
Employed a class-specific distribution alignment loss to manage class imbalance.
Showed improved performance on benchmark datasets compared to existing methods.
Demonstrated that the proposed approach effectively manages data imbalance within partially supervised settings.

Abstract

The partially supervised Compositional Zero-Shot Learning (pCZSL) recognizes new compositions of states and objects, where for every image in the training set either the state or the object annotation is available. In pCZSL, features of a state vary depending on the object in the composition (e.g. the features of state ripe are different for ripe banana and ripe apple). Understanding the variation in features across scales of objects is also a key challenge. In the proposed architecture, a swin transformer based Hierarchical Feature Extractor (HFE) captures the large range of semantic interactions between state and object features. The Discriminative Context Aggregation module utilizes features from the intermediate layers of the HFE to understand the features of object at their corresponding scales. To leverage the partially labeled data in pCZSL, we pass strongly and weakly augmented versions of the input image to the proposed architecture. The predicted class probabilities for strongly and weakly augmented images are encouraged to be similar, minimizing a distribution alignment loss. This loss incorporates class specific re-weighting approach to alleviate the effect of data imbalance for pCZSL. Extensive experiments on three benchmark datasets demonstrate the superiority of the proposed approach.

Bookmark

Partially Supervised Compositional Zero-Shot Learning by Class-Balanced Distribution Alignment

Key Points

Abstract

Cite This Study