Design and implementation of an ontology algorithm for web documents classification

Guiyi Wei; Jun Yu; Yun Ling; Jun Liu

Conference Proceedings

Design and implementation of an ontology algorithm for web documents classification

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2006) 3983 LNCS 649-658

DOI: 10.1007/11751632_71

3Citations

8Readers

Get full text

Abstract

Traditional methods of documents classification need characteristic abstraction and classifier training. The work of collecting trainable text terms is laborious and time-consuming. Additionally, it is difficult to abstract the characteristics from Chinese documents. In order to solve the problem, this paper proposes an ontology-based approach to improve the efficiency and effectiveness of web documents classification and retrieval. Firstly, the approach establishes an ontology model based on Hownet[6] kownledge base and its method. Then, it creates ontologies for each subclass of the classification system. It uses RDFS to convert Hownet into ontology and to define the relations among ontologies. The web documents classification is performed automatically using the ontology relevance calculating algorithm. Comparing with the method of KNN[2], the results of our experiments indicate that the accuracy of ontologybased approach is close to KNN, its algorithms is more robust than KNN, and its recalling rate is better than KNN. © Springer-Verlag Berlin Heidelberg 2006.

Cite

CITATION STYLE

APA

Wei, G., Yu, J., Ling, Y., & Liu, J. (2006). Design and implementation of an ontology algorithm for web documents classification. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3983 LNCS, pp. 649–658). Springer Verlag. https://doi.org/10.1007/11751632_71

Design and implementation of an ontology algorithm for web documents classification

Abstract

Cite

Register to see more suggestions