For vehicle driving scenarios in complex backgrounds, road sign detection faces challenges such as multi-scale targets, long-distances, and low-resolution. To address these challenges, a detection method based on an improved YOLO11n network is proposed. Firstly, to accommodate the multi-scale characteristics of the targets and improve the network’s ability to detect low-resolution objects and details, a Multi-path Gated Aggregation (MGA) Module is proposed, achieving these objectives via multi-dimensional feature extraction. Secondly, the Neck is improved by designing a network structure that incorporates high-resolution information from the Backbone, thereby enhancing the detection capabilities for small and blurry targets. Finally, an enhanced Spatial Pyramid Pooling—Fast (SPPF) module is proposed, wherein a Group Convolution-Layer Normalization-SiLU structure is integrated across various stages of information passing. By fusing adjacent channel information, it effectively suppresses complex background noise across multiple scales and amplifies road marking features, which consequently boosts the model’s discriminability for distant and obscured targets. Experimental results on a multi-type road sign dataset show that the improved model achieves an mAP@0.5 of 96.96%, which is 1.42% higher than the original model. The mAP@0.5–0.95 and Recall rates are 83.94% and 92.94%, respectively, while the inference speed remains at 134 FPS. Research demonstrates that via targeted modular designs, the proposed approach strikes a superior balance between detection accuracy and real-time efficiency. Consequently, it provides robust technical support for the reliable operation of intelligent vehicle perception systems under complex conditions.
Fu et al. (Mon,) studied this question.