Unmanned Aerial Vehicle imagery often suffers from severe spatial detail degradation due to sensor limitations and motion blur, hindering downstream vision tasks. To address this, we propose a lightweight super-resolution framework leveraging a Tiny-Transformer backbone enhanced by a multi-domain feature fusion strategy. Specifically, we jointly model spatial structural semantics and frequency domain texture priors via a cross-domain fusion attention mechanism, enabling coordinated restoration of global consistency and local details. Extensive experiments demonstrate that our method outperforms state-of-the-art approaches on standard benchmarks, achieving significant gains in Peak Signal-to-Noise Ratio and structural similarity index while maintaining low computational cost. Notably, the model exhibits superior robustness in reconstructing high-frequency textures common in aerial scenes. This work provides an efficient, deployable solution for enhancing visual fidelity in resource-constrained applications such as urban planning and precision agriculture.
Man et al. (Sat,) studied this question.
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: