In this paper we view the automated selection of patent classification codes as a collection selection problem that can be addressed using existing methods which we extend and adapt for the patent domain. Our work exploits the manually assigned International Patent Classification (IPC) codes of patent documents to cluster, distribute and index patents through hundreds or thousands of sub-collections. We examine different collection selection methods (CORI, Bordafuse, ReciRank and multilayer) and compare their effectiveness in selecting relevant IPCs. The multilayer method, in addition to utilizing the topical relevance of IPCs at a specific level (e.g. sub-class), exploits the topical relevance of their ancestors in the IPC hierarchy and aggregates those multiple estimations of relevance to a single estimation. The results show that multilayer outperforms CORI and fusion-based methods in the task of IPC suggestion.
CITATION STYLE
Giachanou, A., & Salampasis, M. (2014). IPC selection using collection selection algorithms. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 8849, 41–52. https://doi.org/10.1007/978-3-319-12979-2_4
Mendeley helps you to discover research relevant for your work.