This paper investigated the speaker independent vowel recognition for Malay children using the Time Delay Neural Network (TDNN). Due to less research done on the children speech recognition, the temporal structure of the children speech was not fully understood. This 2 hidden layers TDNN was proposed to discriminate 6 Malay vowels: /a/, /e/, /inverted e sign/, /i/, /o/ and /u/. The speech database consisted of vowel sounds from 360 children speakers. Cepstral coefficient was normalized for the input of TDNN. The frame rate of the TDNN was tested with 10ms, 20ms, and 30ms. It was found out that the 30ms frame rate produced the highest vowel recognition accuracy with 81.92%. The TDNN also showed higher speech recognition rate compared to the previous studies that used Multilayer Perceptron. © 2011 Springer-Verlag.
CITATION STYLE
Yong, B. F., & Ting, H. N. (2011). Speaker-independent vowel recognition for Malay children using time-delay neural network. In IFMBE Proceedings (Vol. 35 IFMBE, pp. 565–568). https://doi.org/10.1007/978-3-642-21729-6_141
Mendeley helps you to discover research relevant for your work.