Semantically Aware Text Categorisation for Metadata Annotation

Giulio Carducci; Marco Leontino; Daniele P. Radicioni; Guido Bonino; Enrico Pasini; Paolo Tripodi

Conference Proceedings

Semantically Aware Text Categorisation for Metadata Annotation

Communications in Computer and Information Science (2019) 988 315-330

DOI: 10.1007/978-3-030-11226-4_25

9Citations

6Readers

Get full text

Abstract

In this paper we illustrate a system aimed at solving a long-standing and challenging problem: acquiring a classifier to automatically annotate bibliographic records by starting from a huge set of unbalanced and unlabelled data. We illustrate the main features of the dataset, the learning algorithm adopted, and how it was used to discriminate philosophical documents from documents of other disciplines. One strength of our approach lies in the novel combination of a standard learning approach with a semantic one: the results of the acquired classifier are improved by accessing a semantic network containing conceptual information. We illustrate the experimentation by describing the construction rationale of training and test set, we report and discuss the obtained results and conclude by drawing future work.

Author supplied keywords

Cite

CITATION STYLE

APA

Carducci, G., Leontino, M., Radicioni, D. P., Bonino, G., Pasini, E., & Tripodi, P. (2019). Semantically Aware Text Categorisation for Metadata Annotation. In Communications in Computer and Information Science (Vol. 988, pp. 315–330). Springer Verlag. https://doi.org/10.1007/978-3-030-11226-4_25

Semantically Aware Text Categorisation for Metadata Annotation

Abstract

Author supplied keywords

Cite

Register to see more suggestions