Diction based prosody modeling in table-to-speech synthesis

Dimitris Spiliotopoulos; Gerasimos Xydas; Georgios Kouroupetroglou

Conference Proceedings

Diction based prosody modeling in table-to-speech synthesis

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2005) 3658 LNAI 294-301

DOI: 10.1007/11551874_38

17Citations

11Readers

Get full text

Abstract

Transferring a structure from the visual modality to the aural one presents a difficult challenge. In this work we are experimenting with prosody modeling for the synthesized speech representation of tabulated structures. This is achieved by analyzing naturally spoken descriptions of data tables and a following feedback by blind and sighted users. The derived prosodic phrase accent and pause break placement and values are examined in terms of successfully conveying semantically important visual information through prosody control in Table-to-Speech synthesis. Finally, the quality of the information provision of synthesized tables when utilizing the proposed prosody specification is studied against plain synthesis. © Springer-Verlag Berlin Heidelberg 2005.

Cite

CITATION STYLE

APA

Spiliotopoulos, D., Xydas, G., & Kouroupetroglou, G. (2005). Diction based prosody modeling in table-to-speech synthesis. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3658 LNAI, pp. 294–301). Springer Verlag. https://doi.org/10.1007/11551874_38

Diction based prosody modeling in table-to-speech synthesis

Abstract

Cite

Register to see more suggestions