What question did this study set out to answer?

The research aims to enhance handwritten text classification using a structure based on TrOCR and a large language model (LLM).

February 6, 2026Open Access

A structure based on TrOCR transformer and large language model for classification of handwritten texts

Read Full Paperexternally

Key Points

The research aims to enhance handwritten text classification using a structure based on TrOCR and a large language model (LLM).
Utilized TrOCR for optical character recognition (OCR) of handwritten texts.
Employed fine-tuning techniques on pre-trained models with various datasets.
Implemented BART model for classifying extracted text based on subjects and content.
Demonstrated improved accuracy in text extraction from handwritten images.
Classified extracted texts effectively into relevant categories.
Provided a robust framework for future applications in handwritten text analysis.

Abstract

Processing handwritten texts and classification and their content analysis are among the most important problems in the realm of text analysis. Microsoft has presented pre-trained TrOCR models for printed and hand-written texts. These models due to prior pre-training are better starting point for image processing. For using TrOCR with the aim of detecting printed and handwritten texts, we can use fine-tuning technique on pre-trained model using different datasets. This process helps the model to learn better the specific features of image processing and hand-written or semihandwritten texts. TrOCR uses transformer models for OCR and its fine-tuning on special datasets especially, hand-written datasets is a common task. TrOCR model from Microsoft extracts text from these images, and in this research a structure based on TrOCR and LLM has been proposed whose aim is extraction of hand-written texts from existing images in a dataset (English handwritten line dataset) and converting them to text data and then this data has been given to LLM as an input so that the extracted texts can be classified (using BART model) based on different subjects and contents.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Hossein KardanMoghaddam

Adel Akbarimajd

Mohammad Ranjbarpour

Journals

Facta universitatis - series Electronics and Energetics

Actions

Institutions

University of Mohaghegh Ardabili

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

A structure based on TrOCR transformer and large language model for classification of handwritten texts

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study