Small object detection in UAV aerial imagery poses significant challenges due to sparse pixel representation and ambiguous object boundaries. Through systematic analysis, we identify three critical degradation stages during forward propagation in deep detection networks: edge attenuation during feature extraction, semantic conflict during feature fusion, and detail loss during feature reconstruction. Existing methods address these stages in isolation or implicitly, lacking collaborative and stage-aware repair strategies. To address this issue, we propose EAGLE-DET, a novel detection framework based on sparse multi-scale attention and refined transformation. Specifically, the framework comprises three core modules: (1) the Cross-stage Multi-resolution Edge Enhancement Network (CMENet), which preserves small object edge representations via adaptive high-low frequency decomposition; (2) the Attention-guided Multi-scale Feature Fusion Network (AMFFN), which resolves cross-scale semantic conflicts through pyramidal sparse attention and multi-scale spatial decoupling; (3) the Enhanced Upsampling with Channel Bridging and Spatial Coordination module (EUCBSC), which recovers spatial detail fidelity via bidirectional channel shift mixing. Extensive experiments on three benchmark datasets—VisDrone-2019, UAVDT, and DOTA1.0—demonstrate the effectiveness of EAGLE-DET, which achieves improvements of 4.5% AP50 and 2.9% AP50:95 on VisDrone-2019 over the baseline, while maintaining inference at 71.7 FPS, achieving an optimal accuracy–efficiency trade-off.
Building similarity graph...
Analyzing shared references across papers
Loading...
Yimeng Tao
Beijing Institute of Technology
Yan Ding
Beijing Institute of Technology
Bo Mo
Beijing Institute of Technology
Sensors
Beijing Institute of Technology
Beijing Agricultural Machinery Research Institute
Beijing Institute of Power Machinery (China)
Building similarity graph...
Analyzing shared references across papers
Loading...
Tao et al. (Wed,) studied this question.
synapsesocial.com/papers/6a22692e763171746d547b75 — DOI: https://doi.org/10.3390/s26113554