The diversities of large-scale semistructured data make the extraction of implicit semantic information have enormous difficulties. This paper proposes an automatic and unsupervised method of text categorization, in which tree-shape structures are used to represent semantic knowledge and to explore implicit information by mining hidden structures without cumbersome lexical analysis. Mining implicit frequent structures in trees can discover both direct and indirect semantic relations, which largely enhances the accuracy of matching and classifying texts. The experimental results show that the proposed algorithm remarkably reduces the time and effort spent in training and classifying, which outperforms established competitors in correctness and effectiveness.
CITATION STYLE
Guo, L., Zuo, W., Peng, T., & Yue, L. (2015). Text Matching and Categorization: Mining Implicit Semantic Knowledge from Tree-Shape Structures. Mathematical Problems in Engineering, 2015. https://doi.org/10.1155/2015/723469
Mendeley helps you to discover research relevant for your work.