To detect and describe categories in a given set of utterances without supervision, one may apply clustering to a space therein representing the utterances as vectors. This paper compares hard and fuzzy word clustering approaches applied to 'almost' unsupervised utterance categorization for a technical support dialog system. Here, 'almost' means that only one sample utterance is given per category to allow for objectively evaluating the performance of the clustering techniques. For this purpose, categorization accuracy of the respective techniques are measured against a manually annotated test corpus of more than 3000 utterances. © 2008 Springer-Verlag Berlin Heidelberg.
CITATION STYLE
Albalate, A., & Suendermann, D. (2008). Hard vs. Fuzzy clustering for speech utterance categorization. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5078 LNCS, pp. 88–98). https://doi.org/10.1007/978-3-540-69369-7_11
Mendeley helps you to discover research relevant for your work.