We describe a speedup for training conditional maximum entropy models. The algorithmisasimple variation on Generalized Iterative Scaling, but converges roughly an order of magnitude faster, depending on the number of constraints, and the way speed is measured. Rather than attempting to train all model parameters simultaneously, the algorithm trains them sequentially. The algorithm is easy to implement, typically uses only slightly more memory, and will lead to improvements for most maximum entropy problems.
CITATION STYLE
Goodman, J. (2002). Sequential conditional generalized iterative scaling. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (Vol. 2002-July, pp. 9–16). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1073083.1073086
Mendeley helps you to discover research relevant for your work.