Punctuation-generation-inspired linguistic features for Mandarin prosody generation

2Citations
Citations of this article
9Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

This paper proposes two novel linguistic features extracted from text input for prosody generation in a Mandarin text-to-speech system. The first feature is the punctuation confidence (PC), which measures the likelihood that a major punctuation mark (MPM) can be inserted at a word boundary. The second feature is the quotation confidence (QC), which measures the likelihood that a word string is quoted as a meaningful or emphasized unit. The proposed PC and QC features are influenced by the properties of automatic Chinese punctuation generation and linguistic characteristic of the Chinese punctuation system. Because MPMs are highly correlated with prosodic–acoustic features and quoted word strings serve crucial roles in human language understanding, the two features could potentially provide useful information for prosody generation. This idea was realized by employing conditional random-field-based models for predicting MPMs, quoted word string locations, and their associated confidences—that is, PC and QC—for each word boundary. The predicted punctuations and their confidences were then combined with traditional linguistic features to predict prosodic–acoustic features for performing speech synthesis using multilayer perceptrons. Both objective and subjective tests demonstrated that the prosody generated with the proposed linguistic features was superior to that generated without the proposed features. Therefore, the proposed PC and QC are identified as promising features for Mandarin prosody generation.

References Powered by Scopus

Long Short-Term Memory

76776Citations
N/AReaders
Get full text

Speech parameter generation algorithms for HMM-based speech synthesis

866Citations
N/AReaders
Get full text

An RNN-based prosodie information synthesizer for mandarin text-to-speech

158Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Cycle consistent network for end-to-end style transfer TTS training

17Citations
N/AReaders
Get full text

Superposition of functional contours based prosodic feature extraction for speech processing

3Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Chiang, C. Y., Hung, Y. P., Yeh, H. Y., Liao, I. B., & Pan, C. M. (2019). Punctuation-generation-inspired linguistic features for Mandarin prosody generation. Eurasip Journal on Audio, Speech, and Music Processing, 2019(1). https://doi.org/10.1186/s13636-019-0147-y

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 2

50%

Professor / Associate Prof. 1

25%

Lecturer / Post doc 1

25%

Readers' Discipline

Tooltip

Linguistics 2

40%

Engineering 2

40%

Computer Science 1

20%

Save time finding and organizing research with Mendeley

Sign up for free