A Hybrid Machine Learning Approach for Information Extraction from Free Text

  • Neumann G
N/ACitations
Citations of this article
6Readers
Mendeley users who have this article in their library.
Get full text

Abstract

We present a hybrid machine learning approach for information extraction from unstructured documents by integrating a learned classifier based on the Maximum Entropy Modeling (MEM), and a classifier based on our work on Data-Oriented Parsing (DOP). The hybrid behavior is achieved through a voting mechanism applied by an iterative tag-insertion algorithm. We have tested the method on a corpus of German newspaper articles about company turnover, and achieved 85.2% F-measure using the hybrid approach, compared to 79.3% for MEM and 51.9% for DOP when running them in isolation.

Cite

CITATION STYLE

APA

Neumann, G. (2006). A Hybrid Machine Learning Approach for Information Extraction from Free Text. In From Data and Information Analysis to Knowledge Engineering (pp. 390–397). Springer-Verlag. https://doi.org/10.1007/3-540-31314-1_47

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free