Improving sequence segmentation learning by predicting trigrams

10Citations
Citations of this article
97Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Symbolic machine-learning classifiers are known to suffer from near-sightedness when performing sequence segmentation (chunking) tasks in natural language processing: without special architectural additions they are oblivious of the decisions they made earlier when making new ones. We introduce a new pointwise-prediction single-classifier method that predicts trigrams of class labels on the basis of windowed input sequences, and uses a simple voting mechanism to decide on the labels in the final output sequence. We apply the method to maximum-entropy, sparsewinnow, and memory-based classifiers using three different sentence-level chunking tasks, and show that the method is able to boost generalization performance in most experiments, attaining error reductions of up to 51%. We compare and combine the method with two known alternative methods to combat near-sightedness, viz. a feedback-loop method and a stacking method, using the memory-based classifier. The combination with a feedback loop suffers from the label bias problem, while the combination with a stacking method produces the best overall results. © 2005 Association for Computational Linguistics.

Cite

CITATION STYLE

APA

van den Bosch, A., & Daelemans, W. (2005). Improving sequence segmentation learning by predicting trigrams. In CoNLL 2005 - Proceedings of the Ninth Conference on Computational Natural Language Learning (pp. 80–87). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1706543.1706557

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free