An improvement of pattern-based information extraction using intuitionistic fuzzy sets

0Citations
Citations of this article
4Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Multi-slot information extraction (IE) is a task that identify several related entities simultaneously. Most researches on this task are concerned with applying IE patterns (rules) to extract related entities from unstructured documents. An important obstacle for the success in this task is unknowingness where text portions containing interested information are. This problem is more complicated when involving languages with sentence boundary ambiguity, e.g. the Thai language. Applying IE rules to all reasonable text portions can degrade the effect of the obstacle, but it raises another problem that is incorrect (unwanted) extractions. This paper aims to present amethod for removing incorrect extractions. In the method, extractions are represented as intuitionistic fuzzy sets (IFSs), and a similarity measure for IFSs is used to calculate distance between IFS of an unclassified extraction and that of each already-classified extraction. The concept of k nearest neighbor is adopted to design whether the unclassified extraction is correct of not. From the preliminary experiment on a medical domain, the proposed technique improves extraction precision while satisfactorily preserving recall.

Cite

CITATION STYLE

APA

Intarapaiboon, P., & Theeramunkong, T. (2016). An improvement of pattern-based information extraction using intuitionistic fuzzy sets. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10053 LNAI, pp. 63–75). Springer Verlag. https://doi.org/10.1007/978-3-319-49397-8_6

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free