A time-delay neural network (TDNN) architecture is used for speaker independent recognition of the long vowel sounds. A brief introduction to the TDNN architecture and a description of the data used for the simulation are given. Previously published work is extended by training the network with the linear predictive coding (LPC) coefficients of speech along with fast Fourier transform bin energies and by allowing longer and variable length utterances. The training has been performed with multiple speakers using English rather than Japanese speech. With these modifications, 100% recognition of all vowels spoken by two speakers was obtained.
CITATION STYLE
Mitchell, R. A., & Shaw, A. (1990). Vowel recognition with a time-delay neural network (pp. 637–640). Publ by IEEE. https://doi.org/10.1109/icsyse.1990.203238
Mendeley helps you to discover research relevant for your work.