Nevisa, a Persian continuous speech recognition system

13Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In this paper we have reviewed Nevisa Persian speech recognition engine. Nevisa is an HMM-based, large vocabulary speaker-independent continuous speech recognition system. Like most successful recognition systems, MFCC with some modification has been used as speech signal features. It also utilizes a VAD based on signal energy and zero-crossing rate. Maximum likelihood estimation criterion the core of which are the classical segmental k-means and Baum-Welsh algorithms is used for training the acoustic models. The system is based on phoneme modeling and utilizes synchronous beam search based on lexicon tree for decoding the acoustic utterances. Language modeling for Persian has been implemented in two statistical (n-gram) and grammatical forms. Nevisa is equipped with out-of-vocabulary capability for applications with small size vocabulary. In order to compensate the effect of accuracy reduction in noisy environments, powerful robustness methods are utilized. Model-based approaches like PMC, MLLR, and MAP, feature robustness methods like CMS, PCA, RCC, and VTLN, and speech enhancement methods like spectral subtraction and Wiener filtering were investigated. Some of these methods were modified to achieve higher robustness. For training Nevisa, Farsdat database was used. To evaluate the system accuracy, a clean test set was selected from Farsdat and four noisy tasks with different noise types were recorded in different real environments. By taking the advantages of the robustness methods, performance of Nevisa in real environments is similar to clean condition. © 2008 Springer-Verlag.

Cite

CITATION STYLE

APA

Sameti, H., Veisi, H., Bahrani, M., Babaali, B., & Hosseinzadeh, K. (2008). Nevisa, a Persian continuous speech recognition system. In Communications in Computer and Information Science (Vol. 6 CCIS, pp. 485–492). https://doi.org/10.1007/978-3-540-89985-3_60

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free