Exploring the Effect of Dialect Mismatched Language Models in Telugu Automatic Speech Recognition

Aditya Yadavalli; Ganesh S. Mirishkar; Anil Kumar Vuppala

Conference Proceedings

Exploring the Effect of Dialect Mismatched Language Models in Telugu Automatic Speech Recognition

NAACL 2022 - 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Student Research Workshop (2022) 292-301

DOI: 10.18653/v1/2022.naacl-srw.36

2Citations

29Readers

Get full text

Abstract

Previous research has found that Acoustic Models (AM) of an Automatic Speech Recognition (ASR) system are susceptible to dialect variations within a language, thereby adversely affecting the ASR. To counter this, researchers have proposed to build a dialect-specific AM while keeping the Language Model (LM) constant for all the dialects. This study explores the effect of dialect mismatched LM by considering three different Telugu regional dialects: Telangana, Coastal Andhra, and Rayalaseema. We show that dialect variations that surface in the form of a different lexicon, grammar, and occasionally semantics can significantly degrade the performance of the LM under mismatched conditions. Therefore, this degradation has an adverse effect on the ASR even when dialect-specific AM is used. We show a degradation of up to 13.13 perplexity points when LM is used under mismatched conditions. Furthermore, we show a degradation of over 9% and over 15% in Character Error Rate (CER) and Word Error Rate (WER), respectively, in the ASR systems when using mismatched LMs over matched LMs.

Cite

CITATION STYLE

APA

Yadavalli, A., Mirishkar, G. S., & Vuppala, A. K. (2022). Exploring the Effect of Dialect Mismatched Language Models in Telugu Automatic Speech Recognition. In NAACL 2022 - 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Student Research Workshop (pp. 292–301). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2022.naacl-srw.36

Exploring the Effect of Dialect Mismatched Language Models in Telugu Automatic Speech Recognition

Abstract

Cite

Register to see more suggestions