What question did this study set out to answer?

The aim is to develop an advanced Optical Character Recognition system tailored for Arabic handwritten text.

May 5, 2026Open Access

An Intelligent OCR Approach for Arabic Handwriting Using a Fine-Tuned TrOCR Model

Key Points

The aim is to develop an advanced Optical Character Recognition system tailored for Arabic handwritten text.
Utilized a fine-tuned transformer-based model (TrOCR) combining a Vision Transformer encoder and decoder.
Employed benchmark datasets like IFN/ENIT and AHCD alongside synthetically generated data for training.
Incorporated a preprocessing pipeline for image enhancement, enabling diverse input formats and generating editable text outputs.
Achieved approximately 4.1% Character Error Rate (CER) and 8.5% Word Error Rate (WER).
Outperformed traditional OCR systems, including Tesseract, in handling Arabic handwriting.
Demonstrated the effectiveness of transformer-based approaches for handwriting recognition tasks.

Abstract

This report presents the development of an intelligent Optical Character Recognition (OCR) system designed specifically for Arabic handwritten text using a fine-tuned Transformer-based model (TrOCR). Arabic handwriting recognition is particularly challenging due to the cursive nature of the script, context-dependent letter forms, and high variability in individual writing styles. To address these challenges, the proposed system leverages a state-of-the-art vision-language architecture that combines a Vision Transformer (ViT) encoder with a Transformer-based decoder, fine-tuned on benchmark datasets such as IFN/ENIT and AHCD, along with synthetically generated data to improve generalization and class balance. The system includes a robust preprocessing pipeline consisting of denoising, binarization, normalization, and skew correction, enabling it to process both scanned and camera-captured images and generate editable outputs in text and PDF formats. Performance is evaluated using standard OCR metrics, including Character Error Rate (CER) and Word Error Rate (WER), with expected results of approximately 4.1% CER and 8.5% WER, outperforming traditional OCR systems like Tesseract. This work highlights the effectiveness of Transformer-based approaches for complex handwriting recognition tasks and contributes to the advancement of Arabic document digitization, with potential applications in domains such as government, education, healthcare, and digital archiving. This work was conducted at Arab International University (AIU), Syria. The official website of the university is: https://www.aiu.edu.sy

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Zena alkodaimi

Actions

Institutions

Arab International University

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

An Intelligent OCR Approach for Arabic Handwriting Using a Fine-Tuned TrOCR Model

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study