An analysis of active learning strategies for sequence labeling tasks

Burr Settles; Mark Craven

Conference Proceedings

An analysis of active learning strategies for sequence labeling tasks

EMNLP 2008 - 2008 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference: A Meeting of SIGDAT, a Special Interest Group of the ACL (2008) 1070-1079

DOI: 10.3115/1613715.1613855

884Citations

642Readers

Get full text

Abstract

Active learning is well-suited to many problems in natural language processing, where unlabeled data may be abundant but annotation is slow and expensive. This paper aims to shed light on the best active learning approaches for sequence labeling tasks such as information extraction and document segmentation. We survey previously used query selection strategies for sequence models, and propose several novel algorithms to address their shortcomings. We also conduct a large-scale empirical comparison using multiple corpora, which demonstrates that our proposed methods advance the state of the art. © 2008 Association for Computational Linguistics.

Cite

CITATION STYLE

APA

Settles, B., & Craven, M. (2008). An analysis of active learning strategies for sequence labeling tasks. In EMNLP 2008 - 2008 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference: A Meeting of SIGDAT, a Special Interest Group of the ACL (pp. 1070–1079). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1613715.1613855

An analysis of active learning strategies for sequence labeling tasks

Abstract

Cite

Register to see more suggestions