Many popular Text Classification (TC) models use simple occurrence or words in a document as features to base their classifications. They commonly assume word occurrences to be statistically independent in their design. Although such assumption does not hold in general. these TC models arc robust and efficient in their task. Some recent studies have shown contextsensitive TC approaches were able to perform better in general. On the other hand, although complex linguistic or semantic features may intuitively be more relevant in TC. studies on their effectiveness have produced mixed and inconclusive results. In this pajier. we present out investigation on the use of some complex linguistic features with two context-sensitive TC methods. Our experimental results show potential advantages of such approach. © Springer-Verlag Berlin Heidelberg 2006.
CITATION STYLE
Wong, A. K. S., Lee, J. W. T., & Yeung, D. S. (2006). Use of linguistic features in context-sensitive text classification. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3930 LNAI, pp. 701–710). https://doi.org/10.1007/11739685_73
Mendeley helps you to discover research relevant for your work.