Improving articulatory feature and phoneme recognition using multitask learning

Ramya Rasipuram; Mathew Magimai-Doss

Conference Proceedings

Improving articulatory feature and phoneme recognition using multitask learning

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2011) 6791 LNCS(PART 1) 299-306

DOI: 10.1007/978-3-642-21735-7_37

11Citations

7Readers

Get full text

Abstract

Speech sounds can be characterized by articulatory features. Articulatory features are typically estimated using a set of multilayer perceptrons (MLPs), i.e., a separate MLP is trained for each articulatory feature. In this paper, we investigate multitask learning (MTL) approach for joint estimation of articulatory features with and without phoneme classification as subtask. Our studies show that MTL MLP can estimate articulatory features compactly and efficiently by learning the inter-feature dependencies through a common hidden layer representation. Furthermore, adding phoneme as subtask while estimating articulatory features improves both articulatory feature estimation and phoneme recognition. On TIMIT phoneme recognition task, articulatory feature posterior probabilities obtained by MTL MLP achieve a phoneme recognition accuracy of 73.2%, while the phoneme posterior probabilities achieve an accuracy of 74.0%. © 2011 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Rasipuram, R., & Magimai-Doss, M. (2011). Improving articulatory feature and phoneme recognition using multitask learning. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6791 LNCS, pp. 299–306). https://doi.org/10.1007/978-3-642-21735-7_37

Improving articulatory feature and phoneme recognition using multitask learning

Abstract

Author supplied keywords

Cite

Register to see more suggestions