We present a novel deep learning framework for high-resolution semantic segmentation, designed to interpret complex visual environments such as cities, rural areas, and natural landscapes. Our method integrates conic geometric embeddings, which is a mathematical approach for capturing spatial relationships, with belief-aware learning, a strategy that adapts model predictions when input data are uncertain or change over time. A multi-scale refinement process further improves boundary accuracy and detail preservation. The proposed model, built on a hybrid Vision Transformer (ViT) backbone and trained end-to-end using adaptive optimization, is evaluated on four benchmark datasets including EDEN, OpenEarthMap, Cityscapes, and iSAID. It achieves 88.94% Accuracy and R2 of 0.859 on EDEN, while surpassing 85.3% Accuracy on Cityscapes. Ablation studies demonstrate that removing Conic Output Embeddings causes drops in Accuracy of up to 2.77% and increases in RMSE, emphasizing their contribution to frequency-aware generalization across diverse conditions. On OpenEarthMap, our model achieves a mean IoU of 73.21%, outperforming previous baselines by 2.9%, and on iSAID, it reaches 80.75% mIoU with improved boundary adherence. Beyond technical performance, the framework enables practical applications such as automated landscape analysis, urban growth monitoring, and sustainable environmental planning. Its consistent results across three independent runs demonstrate both robustness and reproducibility, offering a reliable tool for large-scale geospatial and environmental modeling.
Building similarity graph...
Analyzing shared references across papers
Loading...
Zhang et al. (Mon,) studied this question.
synapsesocial.com/papers/68bb4d196d6d5674bcd009e5 — DOI: https://doi.org/10.3390/math13172806
Junlei Zhang
Kunming University of Science and Technology
Gao Chi
Jiangsu University
Mathematics
Huazhong Agricultural University
Building similarity graph...
Analyzing shared references across papers
Loading...