In this paper, we present an approach for classifying documents based on the notion of a semantic similarity and the effective representation of the content of the documents. The content of a document is annotated and the resulting annotation is represented by a labeled tree whose nodes and edges are represented by concepts lying within a domain ontology. A reasoning process may be carried out on annotation trees, allowing the comparison of documents between each others, for classification or information retrieval purposes. An algorithm for classifying documents with respect to semantic similarity and a discussion conclude the paper. © Springer-Verlag Berlin Heidelberg 2006.
CITATION STYLE
Nauer, E., & Napoli, A. (2006). A proposal for annotation, semantic similarity and classification of textual documents. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4183 LNCS, pp. 201–212). Springer Verlag. https://doi.org/10.1007/11861461_22
Mendeley helps you to discover research relevant for your work.