This paper investigates the role of co-occurrence of low frequent terms in patent classification. A comparison is made between indexing, weighting single term features and multi-term features based on low frequent terms. Three datasets are used for experimentation. An increase of almost 21 percent in classification accuracy is observed through experimentation when multi-term features based on low frequent terms in patents are considered as compared to when all word types are considered.
CITATION STYLE
Khattak, A. S., & Heyer, G. (2014). Exploiting co-occurrence of low frequent terms in patents. In Advances in Intelligent Systems and Computing (Vol. 242, pp. 459–466). Springer Verlag. https://doi.org/10.1007/978-3-319-02309-0_50
Mendeley helps you to discover research relevant for your work.