Image Text to Speech Conversion Using Optical Character Recognition

S. Priyadharshini, P. Alaguvathana, S. Muthulakshmi, C. Akalya, V. Akilaa, V. Gayathri and C.R. Lavanya

Image Text to Speech Conversion Using Optical Character Recognition

¹S. Priyadharshini, P. Alaguvathana, S. Muthulakshmi, C. Akalya, V. Akilaa, V. Gayathri and C.R. Lavanya

222 Views

48 Downloads

download article

Abstract:

Now a days the digital storage is preferred to paper storage. The data are scanned and stored in form of image files. To retrieve an image from large data, text recognition is done. The data in that image can be in any language and also handwritten. Image processing is done to extract text and those texts are converted to audio format in order to avoid ambiguity in handwritten data files as the handwriting of a person is difficult to understand. There are few automated methods in machine learning algorithms which failed to provide accurate results. In this preprocessing the input image using Long Short-Term Memory in Recurrent Neural Network (RNN), a deep learning algorithm is done with addiction to that, Optical Character Recognition (OCR) uses OTSU’s method for image binarization and segmentation then converts texts into audio format with better accuracy and clarity.

Keywords:

Recurrent Neural Network (RNN), Optical Character Recognition (OCR), Long Short-Term Memory (LSTM), OTSU’s Method.

Paper Details

D.O.I10.37200/V24I5/20111

Month4

Year2020

Volume24

IssueIssue 5

Pages4199-4205

Image Text to Speech Conversion Using Optical Character Recognition

1S. Priyadharshini, P. Alaguvathana, S. Muthulakshmi, C. Akalya, V. Akilaa, V. Gayathri and C.R. Lavanya

¹S. Priyadharshini, P. Alaguvathana, S. Muthulakshmi, C. Akalya, V. Akilaa, V. Gayathri and C.R. Lavanya