Mining information extraction rules from datasheets without linguistic parsing

Rakesh Agrawal; Howard Ho; François Jacquenet; Marielle Jacquenet

Conference Proceedings

Mining information extraction rules from datasheets without linguistic parsing

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2005) 3533 LNAI 510-520

DOI: 10.1007/11504894_69

3Citations

6Readers

Get full text

Abstract

In the context of the Pangea project at IBM, we needed to design an information extraction module in order to extract some information from datasheets. Contrary to several information extraction systems based on some machine learning techniques that need some linguistic parsing of the documents, we propose an hybrid approach based on association rules mining and decision tree learning that does not require any linguistic processing. The system may be parameterized in various ways that influence the efficiency of the information extraction rules we discovered. The experiments show the system does not need a large training set to perform well. © Springer-Verlag Berlin Heidelberg 2005.

Author supplied keywords

Cite

CITATION STYLE

APA

Agrawal, R., Ho, H., Jacquenet, F., & Jacquenet, M. (2005). Mining information extraction rules from datasheets without linguistic parsing. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3533 LNAI, pp. 510–520). Springer Verlag. https://doi.org/10.1007/11504894_69

Mining information extraction rules from datasheets without linguistic parsing

Abstract

Author supplied keywords

Cite

Register to see more suggestions