Enhancing Data Accuracy and Efficiency: An Overview of Fuzzy Matching Techniques

Jahnavi Kalluru

Journal ArticleOPEN ACCESS

Enhancing Data Accuracy and Efficiency: An Overview of Fuzzy Matching Techniques

Kalluru J

International Journal of Science and Research (IJSR) (2023) 12(8) 685-690

DOI: 10.21275/sr23805184140

N/ACitations

17Readers

Abstract

: Fuzzy matching, also known as approximate string matching, is a powerful technique designed to improve data accuracy and efficiency by identifying and linking strings that exhibit partial similarity. Unlike traditional exact matching, which requires precise character - by - character agreement, fuzzy matching accounts for typographical errors, misspellings, and variations, allowing for a more flexible comparison. This paper presents an overview of fuzzy matching techniques and their applications across diverse domains. We delve into the core concepts of various algorithms, including Levenshtein distance, Jaccard similarity, soundex, and metaphone, exploring how each method quantifies the similarity between strings. The paper highlights their strengths and use cases in data cleaning, deduplication, information retrieval, natural language processing, record linkage, and named entity recognition.

Cite

CITATION STYLE

APA

Kalluru, J. (2023). Enhancing Data Accuracy and Efficiency: An Overview of Fuzzy Matching Techniques. International Journal of Science and Research (IJSR), 12(8), 685–690. https://doi.org/10.21275/sr23805184140

Enhancing Data Accuracy and Efficiency: An Overview of Fuzzy Matching Techniques

Abstract

Cite

Register to see more suggestions