What question did this study set out to answer?

The aim is to enhance real-time detection of rice lodging areas through a lightweight deep learning model.

April 4, 2026Open Access

StarNet-RiceSeg: An Efficient High-Dimensional Feature Mapping Network with Spatial Attention for Real-Time Rice Lodging Detection

Key Points

The aim is to enhance real-time detection of rice lodging areas through a lightweight deep learning model.
Proposed StarNet-RiceSeg as a lightweight semantic segmentation model.
Utilized the Star Operation for high-dimensional feature mapping.
Incorporated a Rice Spatial Attention module to enhance feature interaction and suppress background noise.
Evaluated the model on a self-constructed high-resolution rice lodging dataset.
Achieved a mIoU of 94.42%, outperforming existing models like U-Net and DeepLabV3+.
Maintained a model size of only 8.01 million parameters.
Real-time inference speed of 32.51 FPS on NVIDIA Jetson Xavier NX platform.

Abstract

The precise, real-time delineation of rice lodging areas constitutes a fundamental prerequisite for the adaptive operation of unmanned combine harvesters. However, existing deep learning methods struggle to resolve a critical limitation: achieving an optimal equilibrium between robust regional morphological perception—which is crucial for irregular lodging patterns—and the ultra-low computational overhead demanded by resource-constrained edge terminals. To address this specific constraint, StarNet-RiceSeg is proposed as a lightweight semantic segmentation network explicitly tailored for unmanned harvesters. Initially, the architecture incorporates the minimalist StarNet as its backbone. By leveraging the unique “Star Operation,” it implicitly maps features into a high-dimensional nonlinear space, thereby significantly augmenting feature discriminability while drastically curtailing computational overhead. Furthermore, to mitigate the misdetection issues stemming from the textural similarity between lodged and upright rice, the Rice Spatial Attention (RSA) module was designed. By intensifying feature interaction within the spatial dimension, this module steers the network to focus on the cohesive morphology of lodged regions while effectively suppressing background noise. Experiments conducted on a self-constructed high-resolution rice lodging dataset demonstrate that StarNet-RiceSeg achieves a mIoU of 94.42%, significantly outperforming mainstream models such as U-Net, DeepLabV3+, SegNet and HRNet. Notably, the model maintains a compact footprint with only 8.01 million parameters and a computational load as low as 9.32 GFLOPs. Following optimization with TensorRT, the system achieved a real-time inference speed of 32.51 FPS on the NVIDIA Jetson Xavier NX embedded platform. These results indicate that StarNet-RiceSeg provides a high-precision, low-latency solution for perceiving rice lodging areas in complex field environments, facilitating unmanned precision harvesting.

StarNet-RiceSeg: An Efficient High-Dimensional Feature Mapping Network with Spatial Attention for Real-Time Rice Lodging Detection

Key Points

Abstract

Cite This Study