An automatic speech recognition system has modules that depend on the language and, while there are many public resources for some languages (e. g., English and Japanese), the resources for Brazilian Portuguese (BP) are still limited. This work describes the development of resources and free tools for BP speech recognition, consisting of text and audio corpora, phonetic dictionary, grapheme-to-phone converter, language and acoustic models. All of them are publicly available and, together with a proposed application programming interface, have been used for the development of several new applications, including a speech module for the OpenOffice suite. Performance tests are presented, comparing the developed BP system with a commercial software. The paper also describes an application that uses synthesis and speech recognition together with a natural language processing module dedicated to statistical machine translation. This application allows the translation of spoken conversations from BP to English and vice versa. The resources make easier the adoption of BP speech technologies by other academic groups and industry. © 2010 The Brazilian Computer Society.
CITATION STYLE
Neto, N., Patrick, C., Klautau, A., & Trancoso, I. (2011). Free tools and resources for Brazilian Portuguese speech recognition. Journal of the Brazilian Computer Society, 17(1), 53–68. https://doi.org/10.1007/s13173-010-0023-1
Mendeley helps you to discover research relevant for your work.