The proliferation of Unmanned Aerial Vehicles (UAVs) has significantly advanced large-scale 3D data acquisition, paving the way for Digital Twin applications. While Neural Radiance Fields (NeRF) and 3D Gaussian Splatting (3DGS) now enable exceptional photorealism, editing these vast neural representations remains a significant challenge. This survey systematically analyzes large-scale scene editing from UAV imagery, specifically focusing on the architectural dichotomy between implicit (NeRF) and explicit (3DGS) representations. We introduce novel taxonomies to deconstruct their core mechanisms and critically evaluate why explicit representations are currently emerging as the more viable path for interactive, city-scale manipulation. Furthermore, we identify key bottlenecks—such as handling oblique views and variable atmospheric conditions—and outline future research directions.A comprehensive and regularly updated collection of the surveyed papers and resources is available at our project website: https://github.com/ruigong335-creator/A-Survey-on-Aerial-Scene-Editing .
Gong et al. (Mon,) studied this question.