Lip Reading Using Convolutional Neural Networks with and without Pre-Trained Models

  • OZCAN T
  • BASTURK A
N/ACitations
Citations of this article
38Readers
Mendeley users who have this article in their library.

Abstract

Lip reading has become a popular topic recently. There is a widespread literature studies on lip reading in human action recognition. Deep learning methods are frequently used in this area. In this paper, lip reading from video data is performed using self designed convolutional neural networks (CNNs). For this purpose, standard and also augmented AvLetters dataset is used train and test stages. To optimize network performance, minibatchsize parameter is also tuned and its effect is investigated. Additionally, experimental studies are performed using AlexNet and GoogleNet pre-trained CNNs. Detailed experimental results are presented.

Cite

CITATION STYLE

APA

OZCAN, T., & BASTURK, A. (2019). Lip Reading Using Convolutional Neural Networks with and without Pre-Trained Models. Balkan Journal of Electrical and Computer Engineering, 7(2), 195–201. https://doi.org/10.17694/bajece.479891

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free