Mongolian Text-to-Speech (TTS) Challenge under Low-Resource Scenario is a special session for National Conference on Man-Machine Speech Communication 2022 (NCMMSC2022), termed as NCMMSC2022-MTTSC. A Mongolian TTS dataset was provided to participants this year, and a low-resource Mongolian TTS task was designed. Specifically, the task is to synthesize high-quality Mongolian speech with given Mongolian scripts. Thirteen teams submitted their results for final evaluation. Mean opinion score (MOS) listening tests were conducted online to measure the naturalness, intelligibility of the synthetic speech. In addition, the word error rate (WER) of automatic speech recognition was further treated as the objective metric for intelligibility evaluation. The evaluation results show that the top system achieved comparable naturalness and intelligibility with the ground truth speech.
CITATION STYLE
Liu, R., Ling, Z. H., Hu, Y. F., Zhang, H., & Gao, G. L. (2023). Mongolian Text-to-Speech Challenge Under Low-Resource Scenario for NCMMSC2022. In Communications in Computer and Information Science (Vol. 1765 CCIS, pp. 221–226). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-981-99-2401-1_20
Mendeley helps you to discover research relevant for your work.