Segment-based hidden Markov models for Information Extraction

Zhenmei Gu; Nick Cercone

Conference Proceedings

Segment-based hidden Markov models for Information Extraction

COLING/ACL 2006 - 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (2006) 1 481-488

DOI: 10.3115/1220175.1220236

14Citations

90Readers

Get full text

Abstract

Hidden Markov models (HMMs) are powerful statistical models that have found successful applications in Information Extraction (IE). In current approaches to applying HMMs to IE, an HMM is used to model text at the document level. This modelling might cause undesired redundancy in extraction in the sense that more than one filler is identified and extracted. We propose to use HMMs to model text at the segment level, in which the extraction process consists of two steps: a segment retrieval step followed by an extraction step. In order to retrieve extractionrelevant segments from documents, we introduce a method to use HMMs to model and retrieve segments. Our experimental results show that the resulting segment HMM IE system not only achieves near zero extraction redundancy, but also has better overall extraction performance than traditional document HMM IE systems. © 2006 Association for Computational Linguistics.

Cite

CITATION STYLE

APA

Gu, Z., & Cercone, N. (2006). Segment-based hidden Markov models for Information Extraction. In COLING/ACL 2006 - 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Vol. 1, pp. 481–488). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1220175.1220236

Segment-based hidden Markov models for Information Extraction

Abstract

Cite

Register to see more suggestions