April 18, 2024

An Intelligent Invoice Processing System Using Tesseract OCR

Key Points

Key points are not available for this paper at this time.

Abstract

Invoice processing is a time-consuming and tedious task that can be automated using optical character recognition (OCR) technology. Tesseract is a popular open-source OCR engine that can be used to extract text from scanned invoices. In this paper, we propose a method for invoice processing using Tesseract OCR. The method involves pre-processing the image of the invoice to remove noise and improve the quality of the text. The pre-processed image is then passed to Tesseract OCR to extract the text. The extracted text is then parsed to extract the relevant invoice information. The results showed that the method was able to extract the invoice information with high accuracy.

Bookmark

Cite This Study

Deepa et al. (Thu,) studied this question.

synapsesocial.com/papers/68e6e8bdb6db6435876642da https://doi.org/https://doi.org/10.1109/adics58448.2024.10533509

Bookmark