A Hybrid Machine Learning Approach for Information Extraction from Free Text

Günter Neumann

Book Chapter

A Hybrid Machine Learning Approach for Information Extraction from Free Text

Neumann G

Springer-Verlag, (2006), 390-397

DOI: 10.1007/3-540-31314-1_47

N/ACitations

6Readers

Get full text

Abstract

We present a hybrid machine learning approach for information extraction from unstructured documents by integrating a learned classifier based on the Maximum Entropy Modeling (MEM), and a classifier based on our work on Data-Oriented Parsing (DOP). The hybrid behavior is achieved through a voting mechanism applied by an iterative tag-insertion algorithm. We have tested the method on a corpus of German newspaper articles about company turnover, and achieved 85.2% F-measure using the hybrid approach, compared to 79.3% for MEM and 51.9% for DOP when running them in isolation.

Cite

CITATION STYLE

APA

Neumann, G. (2006). A Hybrid Machine Learning Approach for Information Extraction from Free Text. In From Data and Information Analysis to Knowledge Engineering (pp. 390–397). Springer-Verlag. https://doi.org/10.1007/3-540-31314-1_47

A Hybrid Machine Learning Approach for Information Extraction from Free Text

Abstract

Cite

Register to see more suggestions