The evolution of data science and the constant challenge of carrying out different processes using a few resources with simultaneous personalization has promoted interest in the development of voice cloning. Nowadays, different machine learning techniques are used, given their efficiency in generating relationships across multiple parameters. In this regard, we evaluated the best-performing models and the different process optimization strategies within this sector, where through neural network models separated modularly by their functionality, it is possible to generate independent processes taking into account the most significant number of linguistic factors in the generation of the voice, thus obtaining significant results of a clear improvement in the whole process of synthesizing the voice of a target speaker.
CITATION STYLE
Fura-Mendoza, M., Moscol-Albañil, I., Rodriguez, C., Lezama, P., Rodriguez, D., & Pomachagua, Y. (2023). Neural Network Strategies and Models for Voice Cloning in a Multi-speaker Mode: An Overview. In Lecture Notes in Networks and Systems (Vol. 685 LNNS, pp. 229–237). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-981-99-1912-3_21
Mendeley helps you to discover research relevant for your work.