What does this research mean for the field?

The proposed Region-Overlap Detection method using the Minimum Convoluted YOLOv7 architecture achieves a mean Average Precision (mAP) of 73.1% and a recall of 70.2% for detecting small floating objects in dynamic river environments. Novelty: ClaimNovelty.METHODOLOGICAL. Consensus alignment: ConsensusAlignment.NEUTRAL.

What question did this study set out to answer?

The aim is to improve the detection accuracy of small floating objects in dynamic river environments using advanced computer vision techniques.

March 3, 2026Open Access

Small target detection of floating objects in river channels based on improved YOLOv7

Key Points

The aim is to improve the detection accuracy of small floating objects in dynamic river environments using advanced computer vision techniques.
Introduced Region-Overlap Detection (ROD) method using Minimum Convoluted YOLOv7 (MCY) architecture.
Utilized YOLO classifier to identify the largest overlap area in multiple overlapping regions.
Extracted bounding boxes with minimal convolution from the final training layer of the neural network.
Achieved a mean Average Precision (mAP) of 73.1% for small floating object detection.
Obtained a recall rate of 70.2% in detecting small targets in dynamic river settings.

Abstract

Computer vision-aided small target detection in moving streams, such as rivers/ roads, requires a fast-converging outcome as the frame requirements are high. The bounding box varies for the multiple frames generated, resulting in low object detection precision. To address the problem of floating object detection, this article introduces a Region-Overlap Detection (ROD) method using the Minimum Convoluted YOLOv7 (MCY) architecture. First, the typical YOLO classifier identifies the largest overlap area from multiple overlapping regions. The second method extracts the largest bounding box in an area with minimal convolution in the neural network's final training layer. Both techniques accurately identify small objects in flowing streams with high mean accuracy. The YOLO architecture trains its convolutional layers using the largest overlap area, shared by many bounding box regions. The intersecting areas are removed from convolutional layers to expedite convergence and increase mAP. The proposed method achieves a high mean Average Precision (mAP) of 73.1% and a recall of 70.2% for small floating object detection in dynamic river environments.

Bookmark

View Full Paper

Bookmark

View Full Paper

Small target detection of floating objects in river channels based on improved YOLOv7

Key Points

Abstract

Cite This Study