What question did this study set out to answer?

The aim is to enhance commodity recognition in smart retail cabinets using an improved YOLOv11 algorithm.

April 11, 2026Open Access

A lightweight YOLOv11-based commodity detector for dense smart retail cabinets

Key Points

The aim is to enhance commodity recognition in smart retail cabinets using an improved YOLOv11 algorithm.
Developed an upgraded YOLOv11 framework with C3K2-MKFENet for better feature extraction.
Replaced the original sampling component with the CARAFE module.
Introduced a fuseModule for merging high-resolution and deep semantic features.
Integrated the CBAM module to focus on important features for occlusion reduction.
Implemented XIoU loss for improved CIoU efficiency.
Achieved a 0.6% increase in mAP50 and a 2.3% increase in mAP50-95 on the YZGOODS dataset.
Demonstrated an efficient trade-off between accuracy and speed on Jetson Nano for embedded devices.

Abstract

The development of smart retail cabinets is tightly intertwined with machine vision technology, yet most confront the bottleneck of limited commodity variety. This is primarily attributed to the vast diversity of commodities, which gives rise to multiple recognition challenges–including small target detection, occlusion, and particularly the identification of similar items. To address these critical engineering issues, this paper proposes an improved lightweight YOLO11 commodity recognition algorithm based on a YOLOv11-based lightweight detection framework with structural refinement and stabilized IoU regression. First, the original C3K2 module is upgraded to C3K2-MKFENet, enhancing feature extraction via partial convolution and multi-scale strategies. Second, the CARAFE module replaces the original sampling component, and a fuseModule is introduced to fuse high-resolution shallow features with deep semantic features, strengthening small target fitting. Third, the CBAM module is integrated to prioritize key features for occlusion mitigation. The XIoU loss is also introduced to improve CIoU efficiency. Experiments on the YZGOODS dataset show mAP50 and mAP50-95 increase by 0.6% and 2.3% respectively; we report GPU reference speed under a unified measurement protocol, and we additionally report an embedded-device benchmark on Jetson Nano, showing a practical accuracy–efficiency trade-off under resource constraints.

اسأل الذكاء الاصطناعي

Bookmark

View Full Paper