Accurate extraction of crop planting structures is important for crop area and yield estimation, but complex and fragmented cropping patterns with overlapping phenology in the Hetao Irrigation District hinder reliable crop discrimination. This study proposes a hierarchical workflow that integrates vegetation masking with multi-source feature optimization for crop mapping. First, dual-temporal Sentinel-2 imagery (May and August) is used to generate a vegetation region-of-interest(ROI) mask via Otsu thresholding applied to the Normalized Difference Vegetation Index (NDVI), combined with pixel-wise maximum-value fusion to reduce phenology-driven omissions and background interference. Second, within the vegetation mask, Sentinel-2 spectral, vegetation-index, and texture features are combined with Sentinel-1 synthetic aperture radar (SAR) backscatter and SAR texture features to construct a multi-source feature set. Random Forest(RF) feature-importance ranking is used to select an effective feature subset, and four classifiers (RF, support vector machine (SVM), eXtreme Gradient Boosting (XGBoost), and convolutional neural network (CNN)) are compared under the same training/validation setting. The vegetation extraction achieves an overall accuracy of 91% (Kappa = 0.80). Using Sentinel-2 features only, the optimized subset with CNN attains the best performance (overall accuracy = 95%, Kappa = 0.93). Adding Sentinel-1 SAR texture features provides an additional improvement (overall accuracy = 96%, Kappa = 0.94), particularly for classes prone to confusion in fragmented plots. Area proportions derived from the final map are consistent with statistical yearbook data (percentage errors: maize 3.45%, sunflower 2.66%, wheat 0.11%, tomato 0.92%) under the study conditions. This workflow supports practical crop-structure monitoring in complex irrigation districts.
Yu et al. (Thu,) studied this question.