Unmanned aerial vehicle (UAV) imagery offers a promising alternative to manual and vehicle-based inspection for highway pavement distress detection, but the high-angle perspective reduces the relative size and feature richness of small distresses and amplifies aliasing during downsampling, limiting the accuracy of existing detectors. To address these problems, this paper proposes an improved YOLOv8 algorithm with four coordinated modifications: (i) a Feature-Focusing Diffusion Pyramid Network (FFDPN) that replaces the conventional PAN to strengthen multi-scale feature fusion and preserve fine-grained details; (ii) an Information Interaction Detection Head (IIDH) that replaces the decoupled dual-branch head, sharing interaction features between the classification and regression branches via deformable convolution (DCNv2) to reduce parameters while improving task synergy; (iii) an Edge Information Extraction Module (EIEM) placed at the front of the backbone, which uses Sobel-based gradient response plus max-pooling to inject low-level edge priors; and (iv) a WaveletPool downsampling operator that decomposes features into LL/LH/HL/HH sub-bands to suppress aliasing of small-scale distresses. Experiments on 3408 UAV images of four distress categories (transverse, longitudinal, and alligator cracks and potholes) show that the improved model reaches 93.7% Precision, 89.6% Recall, and 96.0% mAP@0.50—a 12.2 percentage-point gain over YOLOv8n—while using only 2.41 × 106 parameters and outperforming Faster R-CNN, DETR, YOLOv7-tiny, YOLOv9, YOLOv10n, YOLOv11n, and YOLO-World on the same benchmark. The model eliminates the duplicate and missed detections observed in baselines, at a moderate cost in FPS (30.3 vs. 57.1 for YOLOv8n).
Building similarity graph...
Analyzing shared references across papers
Loading...
Dudu Guo
Wenxing Cai
Hongbo Shuai
Remote Sensing
Xinjiang University
Research Institute of Highway
Xinjiang Institute of Engineering
Building similarity graph...
Analyzing shared references across papers
Loading...
Guo et al. (Thu,) studied this question.
www.synapsesocial.com/papers/69fed10fb9154b0b82878498 — DOI: https://doi.org/10.3390/rs18101461