An Optimized Approach of Modified BAT Algorithm to Record Deduplication

  • Banu.A F
  • C C
N/ACitations
Citations of this article
14Readers
Mendeley users who have this article in their library.

Abstract

The task of recognising, in a data warehouse, records that pass on to the identical real world entity despite misspelling words, kinds, special writing styles or even unusual schema versions or data types is called as the record deduplication. In existing research they offered a genetic programming (GP) approach to record deduplication. Their approach combines several different parts of substantiation extracted from the data content to generate a deduplication purpose that is capable to recognise whether two or more entries in a depository are duplications or not. Because record deduplication is a time intense task even for undersized repositories, their aspire is to promote a method that discovers a proper arrangement of the best pieces of confirmation, consequently compliant a deduplication function that maximises performance using a small representative portion of the corresponding data for preparation purposes also the optimisation of process is less. Our research deals these issues with a novel technique called modified bat algorithm for record duplication. The incentive behind is to generate a flexible and effective method that employs Data Mining algorithms. The structure distributes many similarities with evolutionary computation techniques such as Genetic programming approach. This scheme is initialised with an inhabitant of random solutions and explores for optima by updating bat inventions. Nevertheless, disparate GP, modified bat has no development operators such as crossover and mutation. We also compare the proposed algorithm with other existing algorithms, including GP from the experimental results.

Cite

CITATION STYLE

APA

Banu.A, F., & C, Chandrasekar. (2013). An Optimized Approach of Modified BAT Algorithm to Record Deduplication. International Journal of Computer Applications, 62(1), 10–15. https://doi.org/10.5120/10043-4627

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free