What question did this study set out to answer?

This analysis aims to compare the effectiveness of UNet, CGAN, and Swin-Transformer in removing shadows from images.

February 28, 2026Open Access

Performance Comparison of AI Models for Image Shadow Removal: UNet, CGAN, and Swin-Transformer with a Note on Diffusion Models

Key Points

This analysis aims to compare the effectiveness of UNet, CGAN, and Swin-Transformer in removing shadows from images.
Comparison of UNet, CGAN, and Swin-Transformer models on the ISTD benchmark dataset.
Evaluation using quantitative metrics: PSNR, SSIM, RMSE, MAE.
Qualitative visual assessment of shadow removal performance.
Swin-Transformer outperforms other models in detail preservation and artifact reduction.
CGAN demonstrates enhanced perceptual realism.
UNet offers a computationally efficient baseline for image shadow removal applications.

Abstract

This study conducts a comprehensive performance comparison of three prominent deep learning architectures—UNet, Conditional Generative Adversarial Network (CGAN), and Swin-Transformer—for the task of single-image shadow removal, with additional theoretical consideration given to Denoising Diffusion Probabilistic Models (DDPM). Evaluated on the ISTD benchmark dataset using quantitative metrics (PSNR, SSIM, RMSE, MAE) and qualitative visual assessment, the results establish a clear performance hierarchy. The Swin-Transformer model consistently achieves superior results, excelling in detail preservation, artifact reduction, and maintaining global illumination consistency, attributed to its hierarchical structure and shifted-window self-attention mechanism. The CGAN model demonstrates enhanced perceptual realism through adversarial training, while the UNet provides a computationally efficient baseline. The findings offer practical guidance for model selection based on specific application requirements and highlight the impact of architectural design. This analysis concludes by suggesting future research pathways, including the exploration of hybrid models and the empirical application of diffusion models for high-fidelity image restoration tasks.

Bookmark

View Full Paper

Bookmark

View Full Paper

Performance Comparison of AI Models for Image Shadow Removal: UNet, CGAN, and Swin-Transformer with a Note on Diffusion Models

Key Points

Abstract

Cite This Study