Assigning intonational features in synthesized spoken directions

24Citations
Citations of this article
83Readers
Mendeley users who have this article in their library.

Abstract

Speakers convey much of the information hearers use to interpret discourse by varying prosodic features such as PHRASING, PITCH ACCENT placement, TUNE, and PITCH RANGE. The ability to emulate such variation is crucial to effective (synthetic) speech generation. While text-to-speech synthesis must rely primarily upon structural information to determine appropriate intonational features, speech synthesized from an abstract representation of the message to be conveyed may employ much richer sources. The implementation of an intonation assignment component for Direction Assistance, a program which generates spoken directions, provides a first approximation of how recent models of discourse structure can be used to control intonational variation in ways that build upon recent research in intonational meaning. The implementation further suggests ways in which these discourse models might be augmented to permit the assignment of appropriate intonational features.

Cite

CITATION STYLE

APA

Davis, J. R., & Hirschberg, J. (1988). Assigning intonational features in synthesized spoken directions. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (Vol. 1988-June, pp. 187–193). Association for Computational Linguistics (ACL). https://doi.org/10.3115/982023.982046

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free