The precise, real-time delineation of rice lodging areas constitutes a fundamental prerequisite for the adaptive operation of unmanned combine harvesters. However, existing deep learning methods struggle to resolve a critical limitation: achieving an optimal equilibrium between robust regional morphological perception—which is crucial for irregular lodging patterns—and the ultra-low computational overhead demanded by resource-constrained edge terminals. To address this specific constraint, StarNet-RiceSeg is proposed as a lightweight semantic segmentation network explicitly tailored for unmanned harvesters. Initially, the architecture incorporates the minimalist StarNet as its backbone. By leveraging the unique “Star Operation,” it implicitly maps features into a high-dimensional nonlinear space, thereby significantly augmenting feature discriminability while drastically curtailing computational overhead. Furthermore, to mitigate the misdetection issues stemming from the textural similarity between lodged and upright rice, the Rice Spatial Attention (RSA) module was designed. By intensifying feature interaction within the spatial dimension, this module steers the network to focus on the cohesive morphology of lodged regions while effectively suppressing background noise. Experiments conducted on a self-constructed high-resolution rice lodging dataset demonstrate that StarNet-RiceSeg achieves a mIoU of 94.42%, significantly outperforming mainstream models such as U-Net, DeepLabV3+, SegNet and HRNet. Notably, the model maintains a compact footprint with only 8.01 million parameters and a computational load as low as 9.32 GFLOPs. Following optimization with TensorRT, the system achieved a real-time inference speed of 32.51 FPS on the NVIDIA Jetson Xavier NX embedded platform. These results indicate that StarNet-RiceSeg provides a high-precision, low-latency solution for perceiving rice lodging areas in complex field environments, facilitating unmanned precision harvesting.
Liu et al. (Tue,) studied this question.