What question did this study set out to answer?

The aim is to enhance the detection of trellised watermelons in complex agricultural environments despite challenges like occlusion and scale variation.

March 13, 2026Open Access

Target Detection of Trellised Watermelons in Complex Agricultural Scenes Based on Improved RT-DETR

Key Points

The aim is to enhance the detection of trellised watermelons in complex agricultural environments despite challenges like occlusion and scale variation.
Developed an improved RT-DETR model called RT-DETR-Watermelon.
Embedded a context-guided module into the backbone network.
Added a P2 detection layer for better sensitivity to small objects.
Introduced scale sequence feature fusion and a triple feature encoder for multi-scale detection.
Replaced original bounding box regression loss with MPDIoU loss for improved localization.
The RT-DETR-Watermelon model increased precision by 0.4 percentage points.
Achieved a recall increase of 1.8 percentage points.
Improved mean Average Precision (mAP@0.5) by 1.0 percentage points.
Reduced model parameters by 53.5%, computational cost by 23.5%, and model size by 53.2%.

Abstract

To address the problems of severe fruit occlusion, large variations in target scale, and many small-scale goals being overlooked in the recognition of trellised watermelons under complex agricultural scenarios, this study proposes an improved RT-DETR-based detection model, termed RT-DETR-Watermelon. A context-guided (CG) module is embedded into the backbone network. A dedicated P2 detection layer is added to enhance the model’s sensitivity to small objects. A scale sequence feature fusion (SSFF) module and a triple feature encoder (TFE) module are introduced into the model to improve the model’s capability to detect targets at multiple scales. The original bounding box regression loss is replaced with MPDIoU (Multiple Path Distance Intersection over Union) loss, which accelerates model convergence and improves localization precision. Finally, the number of channels is adjusted to reduce parameter count, computational complexity, and storage size. The experimental results show that, compared with the original RT-DETR model, the proposed RT-DETR-Watermelon model increases precision, recall, and mean Average Precision (mAP@0.5) by 0.4, 1.8, and 1.0 percentage points, while reducing the number of parameters, computational cost, and model size by 53.5%, 23.5%, and 53.2%, respectively.

Target Detection of Trellised Watermelons in Complex Agricultural Scenes Based on Improved RT-DETR

Key Points

Abstract

Cite This Study