Inventory counting of logistics boxes in complex scenarios has always been a core task in intelligent logistics systems. To solve the problems of a high leakage rate and low computational efficiency caused by stacking, occlusion, and rotation in box detection against complex backgrounds in logistics environments, this paper proposes a lightweight, rotated object detection model: REW-YOLO (RepViT-Block YOLO with Efficient Local Attention and Wise-IoU). By integrating structural reparameterization techniques, the C2f-RVB module was designed to reduce computational redundancy in traditional convolutions. Additionally, the ELA-HSFPN multi-scale feature fusion network was constructed to enhance edge feature extraction for occluded boxes and improve detection accuracy in densely packed scenarios. A rotation angle regression branch and a dynamic Wise-IoU loss function were introduced to further refine localization and balance sample quality. Experimental results on the self-constructed BOX-data dataset demonstrate that the REW-YOLO achieves 90.2% mAP50 and 130.8 FPS, with a parameter count of only 2.18 M, surpassing YOLOv8n by 2.9% in accuracy while reducing computational cost by 28%. These improvements provide an efficient solution for automated box detection in logistics applications.
Wang et al. (Mon,) studied this question.