The use of several language models and its impact on word insertion penalty in LVCSR

Gregor Donaj; Zdravko Kačič

Conference Proceedings

The use of several language models and its impact on word insertion penalty in LVCSR

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2013) 8113 LNAI 354-361

DOI: 10.1007/978-3-319-01931-4_47

1Citations

3Readers

Get full text

Abstract

This paper investigates the influence of hypothesis length in N-best list rescoring. It is theoretically explained why language models prefer shorter hypotheses. This bias impacts on the word insertion penalty used in continuous speech recognition. The theoretical findings are confirmed by experiments. Parameter optimization performed on the Slovene Broadcast News database showed why optimal word insertion penalties tend be greater when two language models are used in speech recognition. This paper also presents a two-pass speech recognition algorithm. Two types of language models were used, a standard trigram word-based language model and a trigram model of morpho-syntactic description tags. A relative decrease of 2.02 % in word error rate was achieved after parameter optimization. Statistical tests were performed to confirm the significance of the word error rate decrease. © 2013 Springer International Publishing.

Author supplied keywords

Cite

CITATION STYLE

APA

Donaj, G., & Kačič, Z. (2013). The use of several language models and its impact on word insertion penalty in LVCSR. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8113 LNAI, pp. 354–361). https://doi.org/10.1007/978-3-319-01931-4_47

The use of several language models and its impact on word insertion penalty in LVCSR

Abstract

Author supplied keywords

Cite

Register to see more suggestions