PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit

19Citations
Citations of this article
40Readers
Mendeley users who have this article in their library.
Get full text

Abstract

PaddleSpeech is an open-source all-in-one speech toolkit. It aims at facilitating the development and research of speech processing technologies by providing an easy-to-use command-line interface and a simple code structure. This paper describes the design philosophy and core architecture of PaddleSpeech to support several essential speech-to-text and text-to-speech tasks. PaddleSpeech achieves competitive or state-of-the-art performance on various speech datasets and implements the most popular methods. It also provides recipes and pretrained models to quickly reproduce the experimental results in this paper. PaddleSpeech is publicly avaiable at https://github.com/PaddlePaddle/1 PaddleSpeech.

Cite

CITATION STYLE

APA

Zhang, H., Yuan, T., Chen, J., Li, X., Zheng, R., Huang, Y., … Huang, L. (2022). PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit. In NAACL 2022 - 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Demonstrations Session (pp. 114–123). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2022.naacl-demo.12

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free