Jaro-Winkler Distance Improvement for Approximate String Search Using Indexing Data for Multiuser Application

9Citations
Citations of this article
39Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Word searching method has been developed in many ways and named as: Hamming Distance, Jaccard Distance, Jaro Distance, Jaro-Winkler Distance, Levenshtein Distance, etc. Those methods are used for lexicographic comparison to find words according to the similarity of the words which searched. The time needed for searching by using these words distance method can cause overhead as some difference user might try to search the same words all over. If these method is used in a multi user application where the user generally searching for some keywords repeatedly, then the user might have a longer searching time compared to exact search. In spite of this problem, we try to propose a method where the first search result of the previous user, will be recorded to the database for future usage by indexing the search keywords. In order to try this method, we use Jaro-winkler Distance method to search words. From the test result show that combining indexing and similarity word searching by using Jaro-Winkler Distance method can decrease the searching time to 90-92% compared to just using the Jaro-Winkler Distance method only. As the searched data increased, the processing time can be shorten.

Cite

CITATION STYLE

APA

Friendly, F. (2019). Jaro-Winkler Distance Improvement for Approximate String Search Using Indexing Data for Multiuser Application. In Journal of Physics: Conference Series (Vol. 1361). Institute of Physics Publishing. https://doi.org/10.1088/1742-6596/1361/1/012080

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free