For image generation, denoising diffusion probabilistic models (DDPMs) have shown strong performance. Nevertheless, under class-imbalanced training data, many existing models tend to overfit head classes, which degrades image quality for tail classes. To mitigate this issue, we propose a new generation method, PD-CBDM (perceptual distinguish loss–class-balancing diffusion models). As a first step, PD-CBDM revises the target-label distribution used for label sampling in the baseline pipeline, so tail classes are sampled more frequently during training; this improves the diversity of generated images while keeping fidelity high. Next, we introduce a perceptual distinguish loss that enlarges the separation (measured by the KL divergence in the reverse process) between the data distributions of head and tail classes, which helps suppress head-class overfitting and improves generation quality across classes. Additionally, we propose a timestep-dependent Self-Attention (TSA) module that injects timestep cues into the self-attention mechanism to model temporal and spatial dependencies together, thereby enhancing noise estimation accuracy and image generation quality. Experiments show that PD-CBDM improves FID from 5.81 to 4.96 on CIFAR100-LT and from 5.46 to 5.03 on CIFAR10-LT, and it is competitive with representative recent methods such as BPA and NoisyTwins.
Building similarity graph...
Analyzing shared references across papers
Loading...
Junyan Hu
Wei Luo
Tong Chen
Mathematics
Chang'an University
Xi’an University of Posts and Telecommunications
Building similarity graph...
Analyzing shared references across papers
Loading...
Hu et al. (Thu,) studied this question.
www.synapsesocial.com/papers/69fecfafb9154b0b82876ac9 — DOI: https://doi.org/10.3390/math14101576
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: