Object detection in UAV scenarios is frequently compromised by drastic scale fluctuations and pervasive background clutter. We propose Aero-DETR, featuring a Multi-Scale Perception Stem (MSPS) for early feature adaptation, Global-Spatial Synergistic Attention (GSSA) to suppress noise, and Partitioned Spatial-Adaptive Fusion (PSAF) to mitigate information decay. These modules synergistically enhance foreground saliency and restore geometric details lost during pyramid aggregation. Experiments on VisDrone and SIMD datasets show that Aero-DETR achieves superior detection accuracy while increasing parameters, GFLOPs, and inference latency, indicating an explicit accuracy-efficiency trade-off.
Building similarity graph...
Analyzing shared references across papers
Loading...
Haoyan Zhang
Zhejiang Sci-Tech University
Wentao Lyu
Zhejiang Sci-Tech University
Zhijiang Deng
Foxconn (United States)
IEICE Transactions on Fundamentals of Electronics Communications and Computer Sciences
Zhejiang Sci-Tech University
Chongqing University of Posts and Telecommunications
Foxconn (United States)
Building similarity graph...
Analyzing shared references across papers
Loading...
Zhang et al. (Thu,) studied this question.
synapsesocial.com/papers/6a250a717def13d035e1a9d6 — DOI: https://doi.org/10.1587/transfun.2026eal2026