What question did this study set out to answer?

This research aims to enhance image quality and improve object detection in low-light traffic scenarios using a novel network architecture.

May 1, 2026Open Access

PGT-Net: A Physics-Guided Transformer–CNN Hybrid Network for Low-Light Image Enhancement and Object Detection in Traffic Scenes

Puntos clave

This research aims to enhance image quality and improve object detection in low-light traffic scenarios using a novel network architecture.
Developed a physics-guided transformer-CNN hybrid network (PGT-Net) for image enhancement and object detection.
Integrated atmospheric scattering physical models with a dual-branch enhancement backbone comprising a CNN and a transformer.
Conducted experiments on multiple public datasets, including ExDark and BDD100K-night, to benchmark performance.
PGT-Net achieved significantly higher PSNR and SSIM values compared to traditional methods, indicating improved image quality.
Demonstrated higher object detection accuracy (mAP) compared to mainstream methods like RetinexNet and KinD.
Maintained high inference efficiency, showcasing practical applicability for real-time traffic systems.

Resumen

In autonomous driving and intelligent transportation systems, the degradation of image quality under low-light conditions severely impacts the reliability of subsequent object detection. Existing methods predominantly employ data-driven deep learning models for image enhancement, often lacking physical interpretability and struggling to maintain robustness in complex lighting-varying traffic scenarios. To address this, this paper proposes a Physically Guided Transformer–CNN Hybrid Network (Physically Guided Transformer–CNN Hybrid Network, PGT-Net) for end-to-end joint optimization of low-light enhancement and object detection. PGT-Net innovatively integrates the atmospheric scattering physical model with deep learning architecture: first, a learnable physical guidance branch estimates the scene’s atmospheric illumination map and transmittance map, providing explicit physical priors for the network; second, a dual-branch enhancement backbone is designed, where the local CNN branch (based on an improved UNet) restores fine textures, while the Global Transformer Branch (based on Swin Transformer) models long-range dependencies to correct global uneven illumination, with features adaptively combined via a Physical Fusion Module to ensure enhancement results align with physical laws while retaining rich visual features; finally, the enhanced images are directly fed into a lightweight detection head (e.g., YOLOv7) for joint training and optimization. Comprehensive experiments on public datasets (ExDark, BDD100K-night, etc.) demonstrate that PGT-Net significantly outperforms mainstream methods (e.g., RetinexNet, KinD, Zero-DCE) in both low-light image enhancement quality (PSNR/SSIM) and object detection accuracy (mAP), while maintaining high inference efficiency. This research offers an interpretable, high-performance solution for visual perception tasks under adverse lighting conditions, holding strong theoretical significance and practical value.

Preguntar a la IA

Me gusta

Guardar

Ver artículo completo