Phonetic question generation using misrecognition

Supphanat Kanokphara; Julie Carson-Berndsen

Conference Proceedings

Phonetic question generation using misrecognition

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2006) 4188 LNCS 407-414

DOI: 10.1007/11846406_51

0Citations

3Readers

Get full text

Abstract

Most automatic speech recognition systems are currently based on tied state triphones. These tied states are usually determined by a decision tree. Decision trees can automatically cluster triphone states into many classes according to data available allowing each class to be trained efficiently. In order to achieve higher accuracy, this clustering is constrained by manually generated phonetic questions. Moreover, the tree generated from these phonetic questions can be used to synthesize unseen triphones. The quality of decision trees therefore depends on the quality of the phonetic questions. Unfortunately, manual creation of phonetic questions requires a lot of time and resources. To overcome this problem, this paper is concerned with an alternative method for generating these phonetic questions automatically from misrecognition items. These questions are tested using the standard TIMIT phone recognition task. © Springer-Verlag Berlin Heidelberg.

Cite

CITATION STYLE

APA

Kanokphara, S., & Carson-Berndsen, J. (2006). Phonetic question generation using misrecognition. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4188 LNCS, pp. 407–414). Springer Verlag. https://doi.org/10.1007/11846406_51

Phonetic question generation using misrecognition

Abstract

Cite

Register to see more suggestions