Utilizing WordNet and regular expressions for instance-based schema matching

3Citations
Citations of this article
7Readers
Mendeley users who have this article in their library.

Abstract

Instance-based matching is the process of finding the correspondence of schema elements by comparing the data from different data sources. It is used as an alternative option when the match between schema elements fails. Instance-based matching is applied in many application areas such as website creation and management, schema evolution and migration, data warehousing, database design and data integration. Sometimes the schema information such as (element name, description, data type, etc.) is unavailable or is unable to get the correct match especially when the element name is abbreviation, therefore, if the schema matching failed, the next step is to focus on values stored in the schemas. For these reasons, many recent approaches focus on instance-based matching. In this study, we propose an approach that combines the strength of pattern recognition utilizing regular expressions for numerical domain as well with WordNet for string domain by getting the similarity coefficient in the range of [0,1]. In previous approach, the regular expression is achieved with a good accuracy for numerical instances only and is not implemented on string instances because we need to know the meaning of string to decide if there is a match or not. The using of WordNet-based measures for string instances should guarantee to improve the effectiveness in terms of Precision (P), Recall (R) and F-measure (F). This approach is evaluated with real dataset and the results are found better than using just equality measure for string especially if the schemas are disjoint. The approach achieved 95.3% F-measure (F).

References Powered by Scopus

THE DISTRIBUTION OF THE FLORA IN THE ALPINE ZONE.

3649Citations
N/AReaders
Get full text

A survey of approaches to automatic schema matching

2564Citations
N/AReaders
Get full text

Duplicate record detection: A survey

1544Citations
N/AReaders
Get full text

Cited by Powered by Scopus

A Schema Integration Approach for Big Data Analysis

2Citations
N/AReaders
Get full text

Implementation of regular expression (regex) on knowledge management system

2Citations
N/AReaders
Get full text

Towards a flexible mediator architecture using fuzzy logic for integration of incomplete and uncertain information

1Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Mahdi, A. M., & Tiun, S. (2014). Utilizing WordNet and regular expressions for instance-based schema matching. Research Journal of Applied Sciences, Engineering and Technology, 8(4), 460–470. https://doi.org/10.19026/rjaset.8.994

Readers' Seniority

Tooltip

Lecturer / Post doc 2

67%

PhD / Post grad / Masters / Doc 1

33%

Readers' Discipline

Tooltip

Computer Science 4

80%

Engineering 1

20%

Save time finding and organizing research with Mendeley

Sign up for free