A neural clustering algorithm for estimating visible articulatory trajectory

4Citations
Citations of this article
6Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The bimodal acoustic-visual nature of speech establishes sound correlations between its audio component and the corresponding articulatory information associated to the time-varying geometry of the vocal tract. In this paper we propose an estimation structure consisting of a simplified Time-Delay Neural Network (TDNN) working on 4-5 dimensional cepstrum trajectories provided by a preceding clusterization layer based on a Self Organizing Map (SOM). The use of this pre-processing layer has allowed an effective non-linear clusterization of cepstrum vectors thus simplifying of one order the complexity of the resulting system while maintaining unchanged the global estimation performances. The achieved results are shown in terms estimation precision and robustness with reference to previously published results.

Cite

CITATION STYLE

APA

Vignoli, F., Curinga, S., & Lavagetto, F. (1996). A neural clustering algorithm for estimating visible articulatory trajectory. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 1112 LNCS, pp. 863–868). Springer Verlag. https://doi.org/10.1007/3-540-61510-5_145

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free