Abstract
The paper analyzes existing approaches for approximate string matching based on linear search with Levenshtein distance, AllScan and CPMerge algorithms using cosine, Jaccard and Dice distance measures. The methods are presented and compared to our approach that improves indexing time using Locally Sensitive Hashing. Advantages and drawbacks of the methods are identified based on theoretical considerations as well as empirical evaluations on real-life dictionaries.
Author supplied keywords
Cite
CITATION STYLE
Boguszewski, A., Szymanski, J., & Draszawka, K. (2016). Towards increasing F-measure of approximate string matching in O(1) complexity. In Proceedings of the 2016 Federated Conference on Computer Science and Information Systems, FedCSIS 2016 (pp. 527–532). Institute of Electrical and Electronics Engineers Inc. https://doi.org/10.15439/2016F311
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.