Fast Newton-CG Method for Batch Learning of Conditional Random Fields

1Citations
Citations of this article
13Readers
Mendeley users who have this article in their library.

Abstract

We propose a fast batch learning method for linear-chain Conditional Random Fields (CRFs) based on Newton-CG methods. Newton-CG methods are a variant of Newton method for high-dimensional problems. They only require the Hessian-vector products instead of the full Hessian matrices. To speed up Newton-CG methods for the CRF learning, we derive a novel dynamic programming procedure for the Hessian-vector products of the CRF objective function. The proposed procedure can reuse the byproducts of the time-consuming gradient computation for the Hessian-vector products to drastically reduce the total computation time of the Newton-CG methods. In experiments with tasks in natural language processing, the proposed method outperforms a conventional quasi-Newton method. Remarkably, the proposed method is competitive with online learning algorithms that are fast but unstable.

Cite

CITATION STYLE

APA

Tsuboi, Y., Unno, Y., Kashima, H., & Okazaki, N. (2011). Fast Newton-CG Method for Batch Learning of Conditional Random Fields. In Proceedings of the 25th AAAI Conference on Artificial Intelligence, AAAI 2011 (pp. 489–494). AAAI Press. https://doi.org/10.1609/aaai.v25i1.7894

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free