Traditional methods of documents classification need characteristic abstraction and classifier training. The work of collecting trainable text terms is laborious and time-consuming. Additionally, it is difficult to abstract the characteristics from Chinese documents. In order to solve the problem, this paper proposes an ontology-based approach to improve the efficiency and effectiveness of web documents classification and retrieval. Firstly, the approach establishes an ontology model based on Hownet[6] kownledge base and its method. Then, it creates ontologies for each subclass of the classification system. It uses RDFS to convert Hownet into ontology and to define the relations among ontologies. The web documents classification is performed automatically using the ontology relevance calculating algorithm. Comparing with the method of KNN[2], the results of our experiments indicate that the accuracy of ontologybased approach is close to KNN, its algorithms is more robust than KNN, and its recalling rate is better than KNN. © Springer-Verlag Berlin Heidelberg 2006.
CITATION STYLE
Wei, G., Yu, J., Ling, Y., & Liu, J. (2006). Design and implementation of an ontology algorithm for web documents classification. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3983 LNCS, pp. 649–658). Springer Verlag. https://doi.org/10.1007/11751632_71
Mendeley helps you to discover research relevant for your work.