Performance of a SCFG-based language model with training data sets of increasing size

Joan Andreu Sánchez; José Miguel Benedí; Diego Linares

Conference Proceedings

Performance of a SCFG-based language model with training data sets of increasing size

Lecture Notes in Computer Science (2005) 3523(II) 586-594

DOI: 10.1007/11492542_72

1Citations

3Readers

Get full text

Abstract

In this paper, a hybrid language model which combines a word-based n-gram and a category-based Stochastic Context-Free Grammar (SCFG) is evaluated for training data sets of increasing size. Different estimation algorithms for learning SCFGs in General Format and in Chomsky Normal Form are considered. Experiments on the UPenn Treebank corpus are reported. These experiments have been carried out in terms of the test set perplexity and the word error rate in a speech recognition experiment. © Springer-Verlag Berlin Heidelberg 2005.

Cite

CITATION STYLE

APA

Sánchez, J. A., Benedí, J. M., & Linares, D. (2005). Performance of a SCFG-based language model with training data sets of increasing size. In Lecture Notes in Computer Science (Vol. 3523, pp. 586–594). Springer Verlag. https://doi.org/10.1007/11492542_72

Performance of a SCFG-based language model with training data sets of increasing size

Abstract

Cite

Register to see more suggestions