Diction based prosody modeling in table-to-speech synthesis

17Citations
Citations of this article
11Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Transferring a structure from the visual modality to the aural one presents a difficult challenge. In this work we are experimenting with prosody modeling for the synthesized speech representation of tabulated structures. This is achieved by analyzing naturally spoken descriptions of data tables and a following feedback by blind and sighted users. The derived prosodic phrase accent and pause break placement and values are examined in terms of successfully conveying semantically important visual information through prosody control in Table-to-Speech synthesis. Finally, the quality of the information provision of synthesized tables when utilizing the proposed prosody specification is studied against plain synthesis. © Springer-Verlag Berlin Heidelberg 2005.

Cite

CITATION STYLE

APA

Spiliotopoulos, D., Xydas, G., & Kouroupetroglou, G. (2005). Diction based prosody modeling in table-to-speech synthesis. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3658 LNAI, pp. 294–301). Springer Verlag. https://doi.org/10.1007/11551874_38

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free