Improving sequence segmentation learning by predicting trigrams

Antal van den Bosch; Walter Daelemans

Conference Proceedings

Improving sequence segmentation learning by predicting trigrams

CoNLL 2005 - Proceedings of the Ninth Conference on Computational Natural Language Learning (2005) 80-87

DOI: 10.3115/1706543.1706557

10Citations

97Readers

Get full text

Abstract

Symbolic machine-learning classifiers are known to suffer from near-sightedness when performing sequence segmentation (chunking) tasks in natural language processing: without special architectural additions they are oblivious of the decisions they made earlier when making new ones. We introduce a new pointwise-prediction single-classifier method that predicts trigrams of class labels on the basis of windowed input sequences, and uses a simple voting mechanism to decide on the labels in the final output sequence. We apply the method to maximum-entropy, sparsewinnow, and memory-based classifiers using three different sentence-level chunking tasks, and show that the method is able to boost generalization performance in most experiments, attaining error reductions of up to 51%. We compare and combine the method with two known alternative methods to combat near-sightedness, viz. a feedback-loop method and a stacking method, using the memory-based classifier. The combination with a feedback loop suffers from the label bias problem, while the combination with a stacking method produces the best overall results. © 2005 Association for Computational Linguistics.

Cite

CITATION STYLE

APA

van den Bosch, A., & Daelemans, W. (2005). Improving sequence segmentation learning by predicting trigrams. In CoNLL 2005 - Proceedings of the Ninth Conference on Computational Natural Language Learning (pp. 80–87). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1706543.1706557

Improving sequence segmentation learning by predicting trigrams

Abstract

Cite

Register to see more suggestions