Speakers convey much of the information hearers use to interpret discourse by varying prosodic features such as PHRASING, PITCH ACCENT placement, TUNE, and PITCH RANGE. The ability to emulate such variation is crucial to effective (synthetic) speech generation. While text-to-speech synthesis must rely primarily upon structural information to determine appropriate intonational features, speech synthesized from an abstract representation of the message to be conveyed may employ much richer sources. The implementation of an intonation assignment component for Direction Assistance, a program which generates spoken directions, provides a first approximation of how recent models of discourse structure can be used to control intonational variation in ways that build upon recent research in intonational meaning. The implementation further suggests ways in which these discourse models might be augmented to permit the assignment of appropriate intonational features.
CITATION STYLE
Davis, J. R., & Hirschberg, J. (1988). Assigning intonational features in synthesized spoken directions. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (Vol. 1988-June, pp. 187–193). Association for Computational Linguistics (ACL). https://doi.org/10.3115/982023.982046
Mendeley helps you to discover research relevant for your work.