What question did this study set out to answer?

The study aims to improve road damage detection using UAV imagery through a novel object detection architecture.

March 2, 2026Open Access

Road damage detection method based on UAV imagery and YOLO-SCX

Key Points

The study aims to improve road damage detection using UAV imagery through a novel object detection architecture.
Developed YOLO-SCX based on YOLOv5n with structural optimizations.
Implemented Convolutional Block Attention Module to reduce background noise.
Introduced Grouped SPPCSPC module for enhanced multi-scale feature fusion.
Employed a Decoupled Head for optimized classification and regression tasks.
Used a dataset of 1,500 images segmented for training, validation, and testing.
Achieved a mean Average Precision of 66.3% and Precision of 79.2%.
Showed improvements of 5.8% and 6.0% over the baseline in key performance metrics.
Maintained an inference speed of 185 FPS, suitable for real-time applications.
Model size consists of 8.7 million parameters, making it efficient compared to YOLOv7 and YOLOv8.

Abstract

Automated road damage detection using Unmanned Aerial Vehicle (UAV) imagery is technically constrained by small target dimensions and complex environmental backgrounds. To address these issues within a low-computational budget, this study proposes YOLO-SCX, a computationally efficient object detection architecture based on the YOLOv5n baseline. The methodological novelty of this work lies in the systematic integration of three structural optimizations designed for aerial sensing: (1) the Convolutional Block Attention Module (CBAM) to suppress background noise; (2) a Grouped SPPCSPC module to strengthen multi-scale feature fusion; and (3) a Decoupled Head to independently optimize classification and regression tasks. The research utilizes a composite dataset of 1,500 images derived from UAV-RDD and CrackForest sources, rigorously partitioned into training (1,000), validation (250), and testing (250) sets. Experimental results on the held-out test set demonstrate that YOLO-SCX achieves a mean Average Precision (mAP@0.5) of 66.3% and Precision of 79.2%, representing absolute improvements of 5.8% and 6.0% respectively over the baseline. Furthermore, the model maintains an inference speed of 185 FPS with 8.7 million parameters, confirming its suitability for real-time edge deployment compared to heavier architectures like YOLOv7 and YOLOv8.

Mark Helpful

Bookmark

Relay

View Full Paper