Bottom-up relational learning of pattern matching rules for information extraction

141Citations
Citations of this article
154Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Information extraction is a form of shallow text processing that locates a specified set of relevant items in a natural-language document. Systems for this task require significant domain-specific knowledge and are time-consuming and difficult to build by hand, making them a good application for machine learning. We present an algorithm, RAPIER, that uses pairs of sample documents and filled templates to induce pattern-match rules that directly extract fillers for the slots in the template. RAPIER is a bottom-up learning algorithm that incorporates techniques from several inductive logic programming systems. We have implemented the algorithm in a system that allows patterns to have constraints on the words, part-of-speech tags, and semantic classes present in the filler and the surrounding text. We present encouraging experimental results on two domains.

Cite

CITATION STYLE

APA

Califf, M. E., & Mooney, R. J. (2004). Bottom-up relational learning of pattern matching rules for information extraction. Journal of Machine Learning Research, 4(2), 177–210. https://doi.org/10.1162/153244304322972685

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free