Approximation lasso methods for language modeling

Jianfeng Gao; Hisami Suzuki; Bin Yu

Conference Proceedings

Approximation lasso methods for language modeling

COLING/ACL 2006 - 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (2006) 1 225-232

DOI: 10.3115/1220175.1220204

11Citations

90Readers

Get full text

Abstract

Lasso is a regularization method for parameter estimation in linear models. It optimizes the model parameters with respect to a loss function subject to model complexities. This paper explores the use of lasso for statistical language modeling for text input. Owing to the very large number of parameters, directly optimizing the penalized lasso loss function is impossible. Therefore, we investigate two approximation methods, the boosted lasso (BLasso) and the forward stagewise linear regression (FSLR). Both methods, when used with the exponential loss function, bear strong resemblance to the boosting algorithm which has been used as a discriminative training method for language modeling. Evaluations on the task of Japanese text input show that BLasso is able to produce the best approximation to the lasso solution, and leads to a significant improvement, in terms of character error rate, over boosting and the traditional maximum likelihood estimation. © 2006 Association for Computational Linguistics.

Cite

CITATION STYLE

APA

Gao, J., Suzuki, H., & Yu, B. (2006). Approximation lasso methods for language modeling. In COLING/ACL 2006 - 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Vol. 1, pp. 225–232). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1220175.1220204

Approximation lasso methods for language modeling

Abstract

Cite

Register to see more suggestions