International Journal of Innovative Research in                 Electrical, Electronics, Instrumentation and Control Engineering

A monthly Peer-reviewed / Refereed journal

ISSN Online 2321-2004
ISSN Print 2321-5526

Since 2013

Abstract: “OCR Based Text Extraction From Images” is based on text recognition from image and text-to-speech conversion. It converts the text within an image into speech format and reads it out. Image has text characters which is the main source of information for content-based indexing. The goal of text recognition is to recognize the text from printed hardcopy documents to the desired format.However, these text characters are difficult to be detected and recognized due to their varying sizes and complex backgrounds. In the segmentation step, we model the distribution of grayscale values of pixels. Finally, they are processed by OCR. OCR is the technology that is the answer for extracting data from the images and any documents and convert into computer-readable forms which can be helpful for editing or searching.Images are converted to text files that will be further converted to audio files.

Keywords: OCR,TTS,Text detection, preprocessing, Gaussian blur, spell correction


PDF | DOI: 10.17148/IJIREEICE.2022.10572

Open chat