Prosody Modeling for Automatic Speech Recognition and Understanding

  • Shriberg E
  • Stolcke A
N/ACitations
Citations of this article
33Readers
Mendeley users who have this article in their library.
Get full text

Abstract

This paper summarizes statistical modeling approaches for the use of prosody (the rhythm and melody of speech) in automatic recognition and understanding of speech. We outline effective prosodic feature extraction, model architectures, and techniques to combine prosodic with lexical (word-based) information. We then survey a number of applications of the framework, and give results for automatic sentence segmentation and disfluency detection, topic segmentation, dialog act labeling, and word recognition.

Cite

CITATION STYLE

APA

Shriberg, E., & Stolcke, A. (2004). Prosody Modeling for Automatic Speech Recognition and Understanding (pp. 105–114). https://doi.org/10.1007/978-1-4419-9017-4_5

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free