Perception of synthetic speech produced automatically by rule: Intelligibility of eight text-to-speech systems

Beth G. Greene; John S. Logan; David B. Pisoni

Journal ArticleOPEN ACCESS

Perception of synthetic speech produced automatically by rule: Intelligibility of eight text-to-speech systems

Behavior Research Methods, Instruments, & Computers (1986) 18(2) 100-107

DOI: 10.3758/BF03201008

67Citations

29Readers

Abstract

We present the results of studies designed to measure the segmental intelligibility of eight text-to-speech systems and a natural speech control, using the Modified Rhyme Test (MRT). Results indicated that the voices tested could be grouped into four categories: natural speech, high-quality synthetic speech, moderate-quality synthetic speech, and low-quality synthetic speech. The overall performance of the best synthesis system, DECtalk-Paul, was equivalent to natural speech only in terms of performance on initial consonants. The findings are discussed in terms of recent work investigating the perception of synthetic speech under more severe conditions. Suggestions for future research on improving the quality of synthetic speech are also considered. © 1986 Psychonomic Society, Inc.

Cite

CITATION STYLE

APA

Greene, B. G., Logan, J. S., & Pisoni, D. B. (1986). Perception of synthetic speech produced automatically by rule: Intelligibility of eight text-to-speech systems. Behavior Research Methods, Instruments, & Computers, 18(2), 100–107. https://doi.org/10.3758/BF03201008

Perception of synthetic speech produced automatically by rule: Intelligibility of eight text-to-speech systems

Abstract

Cite

Register to see more suggestions