We present a novel approach to incorporating semantic information to the problems of natural language processing, in particular to the document classification task. The approach builds on the intuition that semantic relatedness of words can be viewed as a non-static property of the words that depends on the particular task at hand. The semantic relatedness information is incorporated using feature transformations, where the transformations are based on a feature ontology and on the particular classification task and data. We demonstrate the approach on the problem of classifying MEDLINE-indexed documents using the MeSH ontology. The results suggest that the method is capable of improving the classification performance on most of the datasets.
CITATION STYLE
Ginter, F., Pyysalo, S., Boberg, J., Järvinen, J., & Salakoski, T. (2004). Ontology-based feature transformations: A data-driven approach. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3230, pp. 279–290). Springer Verlag. https://doi.org/10.1007/978-3-540-30228-5_25
Mendeley helps you to discover research relevant for your work.