Cropland constitutes a foundational resource for global food security and agricultural sustainability, and its accurate extraction from high-resolution remote sensing imagery is essential for agricultural monitoring and land management. However, existing deep learning-based segmentation methods often struggle to balance global contextual modeling and fine-grained boundary representation, leading to boundary blurring and omission of small cropland parcels. To address these challenges, this study proposes a novel CNN–Transformer dual-branch fusion network, named CAFM-Net, which integrates a convolution and attention fusion module (CAFM) and an edge-assisted supervision head (EH) to jointly enhance global–local feature interaction and boundary delineation capability. Experiments were conducted on a self-built PlanetScope cropland dataset from Suihua City, China, and the GID public dataset to evaluate the effectiveness and generalization ability of the proposed model. On the self-built dataset, CAFM-Net achieved an overall accuracy (OA) of 96.75%, an F1-score of 96.80%, and an Intersection over Union (IoU) of 93.79%, outperforming mainstream models such as UNet, DeepLabV3+, TransUNet, and Swin Transformer by a clear margin. On the GID public dataset, CAFM-Net obtained an OA of 94.58%, an F1-score of 94.19%, and an IoU of 89.02%, demonstrating strong robustness across different data sources. Ablation experiments further confirm that the CAFM contributes most significantly to performance improvement, while the EH module effectively enhances boundary accuracy. Overall, the proposed CAFM-Net provides a quantitatively validated and robust solution for fine-grained cropland segmentation from high-resolution remote sensing imagery, with clear advantages in boundary precision and small-parcel detection.
Ren et al. (Thu,) studied this question.
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: