A comparison of methods for automatic term extraction for domain analysis

3Citations
Citations of this article
4Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Fourteen word frequency metrics were tested to evaluate their effectiveness in identifying vocabulary in a domain. Fifteen domain-engineering projects were examined to measure how closely the vocabularies selected by the fourteen word frequency metrics were to the vocabularies produced by domain engineers. Stemming and stopword removal were also evaluated to measure their impact on selecting proper vocabulary terms. The results of the experiment show that stemming and stopword removal do improve performance and that term frequency is a valuable contributor to performance. Most word frequency metrics gave similar results. A few of the metrics did poorly compared to the others.

Cite

CITATION STYLE

APA

Frakes, W. B., Kulczycki, G., & Tilley, J. (2014). A comparison of methods for automatic term extraction for domain analysis. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8919, pp. 269–281). Springer Verlag. https://doi.org/10.1007/978-3-319-14130-5_19

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free