What question did this study set out to answer?

This work aims to improve detection and segmentation accuracy for driving perception systems under low-light conditions by relocating enhancement to the FPGA front end.

March 18, 2026Open Access

FPGA-Based Front-End Low-Light Enhancement for Deterministic Vision-Only Driving Perception

Key Points

This work aims to improve detection and segmentation accuracy for driving perception systems under low-light conditions by relocating enhancement to the FPGA front end.
Designed a hardware-accelerated visual pipeline on FPGA
Performed color space conversion and fixed-point convolutional enhancement
Executed multi-channel fusion prior to perception on an ARM processor
Compared latency and throughput with software-based enhancement approaches
Achieved only 13 ms additional processing latency with FPGA enhancement
Maintained a sustained throughput of 58 fps for real-time operations
Showed zero net overhead on the end-to-end processing pipeline compared to serial software approaches

Abstract

Vision-only driving perception systems are highly sensitive to illumination variations, particularly under low-light conditions where reduced contrast and structural degradation impair detection and segmentation accuracy. Rather than treating enhancement as a post-processing step, this work investigates the system-level impact of relocating low-light enhancement to the FPGA-based front end within a heterogeneous FPGA–ARM architecture. A hardware-accelerated visual pipeline is designed to perform color space conversion, fixed-point convolutional enhancement, and multi-channel fusion prior to high-level perception on the ARM processor. Experimental results demonstrate that the proposed FPGA-based front-end enhancement introduces only 13 ms of additional processing latency, which executes in parallel with the preceding frame’s neural network inference and therefore imposes zero net overhead on the end-to-end pipeline. In contrast, an equivalent software-based back-end enhancement approach would add its full processing time serially to the inference stage, increasing total system latency proportionally. The system achieves a sustained throughput of 58 fps while supporting real-time multi-task perception including lane detection (YOLOPv2, 539 ms per frame), object detection and emergency braking (YOLOv5, 432 ms per frame), and hardware-level multi-camera synchronization.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Xie et al. (Sun,) studied this question.

synapsesocial.com/papers/69ba43f74e9516ffd37a5c17 https://doi.org/https://doi.org/10.3390/electronics15061224

Bookmark

View Full Paper