What does this research mean for the field?

The proposed hybrid framework for sheep face recognition achieves 98.97% accuracy and operates at nearly 100 FPS on edge devices, significantly enhancing individual sheep identification in complex farming environments. Novelty: ClaimNovelty.NOVEL_FINDING. Consensus alignment: ConsensusAlignment.NEUTRAL.

What question did this study set out to answer?

This research aims to improve sheep face recognition by addressing challenges in complex farming environments.

March 12, 2026Open Access

Robust Sheep Face Recognition in Complex Environments: A Hybrid Approach Combining Wavelet-Aware RT-DETR and Adaptive MobileViT

Key Points

This research aims to improve sheep face recognition by addressing challenges in complex farming environments.
Developed a cascaded framework combining WRT-DETR for detection and LG-MobileViT for identification.
Implemented multi-scale wavelet residual modeling in WRT-DETR to handle complex backgrounds.
Utilized local-global collaborative modeling in LG-MobileViT for fine-grained feature recognition.
Conducted experiments on a dataset comprising 400 sheep and 20,000 images.
WRT-DETR achieved 92.5% mean Average Precision (mAP50) in detection tasks.
LG-MobileViT reached 98.97% recognition accuracy with a compact 4.57 MB parameter size.
The integrated system achieved nearly 100 frames per second (FPS) on edge computing platforms.

Abstract

Deep learning-based sheep face recognition technology significantly enhances the automation of individual sheep identification, providing critical technical support for smart livestock farming and precision agriculture. However, in real farming environments, factors such as complex backgrounds, illumination variations, and the high visual similarity of sheep faces severely constrain the comprehensive performance of recognition systems regarding accuracy and real-time capability. To address these challenges, we propose a cascaded framework comprising the WRT-DETR model for detection and LG-MobileViT for identification. WRT-DETR integrates multi-scale wavelet residual modeling and adaptive feature interaction into the RT-DETR architecture to effectively handle complex backgrounds. Subsequently, LG-MobileViT utilizes local–global collaborative modeling to distinguish fine-grained features while maintaining a lightweight footprint suitable for edge devices. Experiments conducted on a dataset of 400 individuals and 20,000 images demonstrate that WRT-DETR achieves 92.5% mAP50 in detection tasks. Furthermore, LG-MobileViT attains 98.97% recognition accuracy with a parameter size of only 4.57 MB. On edge computing platforms, the integrated system reaches an inference speed approaching 100 FPS. These results confirm that the proposed framework offers an efficient, reliable technical solution for non-contact, precise sheep identification in practical precision agriculture scenarios.

Bookmark

View Full Paper

Bookmark

View Full Paper

Robust Sheep Face Recognition in Complex Environments: A Hybrid Approach Combining Wavelet-Aware RT-DETR and Adaptive MobileViT

Key Points

Abstract

Cite This Study