Correcting and standardizing crude drug names in traditional medicine formulae by ensemble of string matching techniques

3Citations
Citations of this article
2Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Common problems of representing crude drug names in traditional herbal formulae are spelling errors, grammatical variants, synonyms and various formats. In order to make these names more obvious and useful, correcting and standardizing of these names should be applied. In this work, crude drug names in various forms were corrected and standardized by string matching techniques. A set of experiments were done using crude drug names from a database of registered traditional medicines in Thai Food and Drug Administration as the test set. Two well-known algorithms, i.e., similar text and Levenshtein were investigated. However, the results from each algorithm indicated that crude drug names in the test set were moderately matched with those of the standard set. To increase performance of these single algorithms, the ensemble algorithm was proposed. From the results, the ensemble algorithm outperforms single algorithms to match crude drug names, especially crude drug names with the modifier that have no significant meaning.

Cite

CITATION STYLE

APA

Pakdeesattayapong, D., & Lertnattee, V. (2015). Correcting and standardizing crude drug names in traditional medicine formulae by ensemble of string matching techniques. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9226, pp. 237–247). Springer Verlag. https://doi.org/10.1007/978-3-319-22186-1_24

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free