Enhancing voice quality in vocal tract rehabilitation device

0Citations
Citations of this article
4Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The assistive devices used for vocal rehabilitation by patients after Laryngectomy produce a distinctly robotic sounding speech. This study aims at introducing human-like qualities into the synthetically generated voices. A simplified source filter model, LPC coefficients and line spectral frequencies were used to characterize the vocal tract and manipulate the acoustic properties of speech. Two different mapping functions were employed: A Gaussian mixture model (GMM) and a linear regression model (LR). Objective and subjective testing showed that both mapping functions produced significant changes in the re-synthesised speech, with the LR mapping producing slightly better results. However, the subjective listening tests indicated that re- synthesized voices improved on the synthetic voice but still lacked human quality. This may imply that the vocal tract model contains only partial information pertaining to the subjective perception of artificiality in speech. Future work is aimed at investigating an elaborate model containing the speech production excitation and radiation signals.

Cite

CITATION STYLE

APA

Sutcliffe, B., Wiggins, L., Rubin, D. M., & Aharonson, V. (2019). Enhancing voice quality in vocal tract rehabilitation device. In Advances in Intelligent Systems and Computing (Vol. 794, pp. 1006–1013). Springer Verlag. https://doi.org/10.1007/978-3-319-94947-5_99

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free