Image Text to Speech Conversion Using Optical Character Recognition

1S. Priyadharshini, P. Alaguvathana, S. Muthulakshmi, C. Akalya, V. Akilaa, V. Gayathri and C.R. Lavanya

222 Views
48 Downloads
Abstract:

Now a days the digital storage is preferred to paper storage. The data are scanned and stored in form of image files. To retrieve an image from large data, text recognition is done. The data in that image can be in any language and also handwritten. Image processing is done to extract text and those texts are converted to audio format in order to avoid ambiguity in handwritten data files as the handwriting of a person is difficult to understand. There are few automated methods in machine learning algorithms which failed to provide accurate results. In this preprocessing the input image using Long Short-Term Memory in Recurrent Neural Network (RNN), a deep learning algorithm is done with addiction to that, Optical Character Recognition (OCR) uses OTSU’s method for image binarization and segmentation then converts texts into audio format with better accuracy and clarity.

Keywords:

Recurrent Neural Network (RNN), Optical Character Recognition (OCR), Long Short-Term Memory (LSTM), OTSU’s Method.

Paper Details
Month4
Year2020
Volume24
IssueIssue 5
Pages4199-4205