Pixel-Wise Method for Enhanced Tesseract OCR Accuracy Using Colour and Spatial Distances
DOI:
https://doi.org/10.70594/brain/16.2/29Keywords:
OCR, noise reduction, Tesseract, image preprocessing, image enhancement, OCR accuracy improvement, contrast correctionAbstract
Digital images often contain noise introduced during acquisition, storage, or transmission, which can hinder the performance of Optical Character Recognition systems. Effective noise reduction is essential for improving the accuracy of these systems, as noise can obscure text and reduce recognition rates. The problem of removing noise from images is widely studied in computer vision but remains challenging due to the variety of noise types and the risk of introducing artifacts or blurring. In this work, we propose a new preprocessing algorithm that is used in conjunction with the Tesseract engine, in order to improve its overall accuracy. We test this method against the SmartDoc dataset, which contains images taken from mobile devices, and obtain an improvement over the original accuracy of 6.5%. The method is also compared to several other classical algorithms such as Mean Filter, Median Filter, Bilateral Filter, Adaptive Smoothing, and others showing improved results over each individual one.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Mihai-Lucian Voncilă, Nicolae Tarbă, Cosmin-Dumitru Oprea, Costin Anton Boiangiu, Nicolae Goga (Author)

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.