Enhancing Data Accuracy and Efficiency: An Overview of Fuzzy Matching Techniques

  • Kalluru J
N/ACitations
Citations of this article
17Readers
Mendeley users who have this article in their library.

Abstract

: Fuzzy matching, also known as approximate string matching, is a powerful technique designed to improve data accuracy and efficiency by identifying and linking strings that exhibit partial similarity. Unlike traditional exact matching, which requires precise character - by - character agreement, fuzzy matching accounts for typographical errors, misspellings, and variations, allowing for a more flexible comparison. This paper presents an overview of fuzzy matching techniques and their applications across diverse domains. We delve into the core concepts of various algorithms, including Levenshtein distance, Jaccard similarity, soundex, and metaphone, exploring how each method quantifies the similarity between strings. The paper highlights their strengths and use cases in data cleaning, deduplication, information retrieval, natural language processing, record linkage, and named entity recognition.

Cite

CITATION STYLE

APA

Kalluru, J. (2023). Enhancing Data Accuracy and Efficiency: An Overview of Fuzzy Matching Techniques. International Journal of Science and Research (IJSR), 12(8), 685–690. https://doi.org/10.21275/sr23805184140

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free