Improving quality of voice conversion systems

M. Farhid; M. A. Tinati

Conference Proceedings

Improving quality of voice conversion systems

Communications in Computer and Information Science (2008) 6 CCIS 880-883

DOI: 10.1007/978-3-540-89985-3_124

0Citations

2Readers

Get full text

Abstract

New improvement scheme for voice conversion are proposed in this paper. We take Human factor cepstral coefficients (HFCC), a modification of MFCC that uses the known relationship between center frequency and critical bandwidth from human psychoacoustics to decouple filter bandwidth from filter spacing, as the basic feature. We propose U/V (Unvoiced/Voiced) decision rule such that two sets of codebooks are used to capture the difference between unvoiced and voiced segments of the source speaker. Moreover, we apply three schemes to refine the synthesized voice, including pitch refinement, energy equalization, and frame concatenation. The acceptable performance of the voice conversion system can be verified through ABX listening test and MOS grad. © 2008 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Farhid, M., & Tinati, M. A. (2008). Improving quality of voice conversion systems. In Communications in Computer and Information Science (Vol. 6 CCIS, pp. 880–883). https://doi.org/10.1007/978-3-540-89985-3_124

Improving quality of voice conversion systems

Abstract

Author supplied keywords

Cite

Register to see more suggestions