Comparison between handwritten word and speech record in real-time using CNN architectures

Javier O. Pinzón-Arenas; Robinson Jiménez-Moreno

Journal ArticleOPEN ACCESS

Comparison between handwritten word and speech record in real-time using CNN architectures

International Journal of Electrical and Computer Engineering (2020) 10(4) 4313-4321

DOI: 10.11591/ijece.v10i4.pp4313-4321

7Citations

18Readers

Abstract

This paper presents the development of a system of comparison between words spoken and written by means of deep learning techniques. There are used 10 words acquired by means of an audio function and, these same words, are written by hand and acquired by a webcam, in such a way as to verify if the two data match and show whether or not it is the required word. For this, 2 different CNN architectures were used for each function, where for voice recognition, a suitable CNN was used to identify complete words by means of their features obtained with mel frequency cepstral coefficients, while for handwriting, a faster R-CNN was used, so that it both locates and identifies the captured word. To implement the system, an easy-to-use graphical interface was developed, which unites the two neural networks for its operation. With this, tests were performed in real-time, obtaining a general accuracy of 95.24%, allowing showing the good performance of the implemented system, adding the response speed factor, being less than 200 ms in making the comparison.

Author supplied keywords

Cite

CITATION STYLE

APA

Pinzón-Arenas, J. O., & Jiménez-Moreno, R. (2020). Comparison between handwritten word and speech record in real-time using CNN architectures. International Journal of Electrical and Computer Engineering, 10(4), 4313–4321. https://doi.org/10.11591/ijece.v10i4.pp4313-4321

Comparison between handwritten word and speech record in real-time using CNN architectures

Abstract

Author supplied keywords

Cite

Register to see more suggestions