What type of study is this?

This is a Experimental Study study.

September 23, 2025

Robust OCR Pipeline for Digit Display Recognition Using TrOCR, YOLOv8, and Multi-Layered Fallbacks

Key Points

The proposed OCR pipeline achieves a 97% success rate on real-world digit displays, surpassing standalone OCR systems.
Combining TrOCR with YOLOv8 and fallback mechanisms proves effective in addressing segmentation complexity and lighting variations.
Custom decimal correction enhances output accuracy, tackling challenges associated with inconsistent formatting.
Performance analysis presents insights into failure modes from prior CNN segmentation methods, guiding future improvements.

Abstract

The variable lighting conditions, segmentation complexity, and inconsistent formatting of digital displays, like those found on utility meters and fuel pumps, present ongoing challenges for optical character recognition (OCR) systems. For ROI detection, we suggest a reliable, multi-model OCR pipeline that combines a refined TrOCR model with YOLOv8 and is enhanced with fallback mechanisms utilizing Tesseract and EasyOCR. Numerical output integrity is improved by a custom decimal correction procedure. After post-processing, our suggested approach outperforms standalone OCR engines by achieving a 97% success rate on real-world digit displays. We examine failure cases from previous CNN-based segmentation attempts, present comparative performance analysis, and describe upcoming work for wider deployment.

Bookmark

Cite This Study

D. Menezes (Tue,) studied this question.

synapsesocial.com/papers/68d4757f31b076d99fa6ccb8 https://doi.org/https://doi.org/10.22214/ijraset.2025.74251

Also Consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

Bookmark