GenDesc: A partial generalization of linguistic features for text classification

Guillaume Tisserant; Violaine Prince; Mathieu Roche

Conference Proceedings

GenDesc: A partial generalization of linguistic features for text classification

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2013) 7934 LNCS 343-348

DOI: 10.1007/978-3-642-38824-8_35

0Citations

3Readers

Get full text

Abstract

This paper presents an application that belongs to automatic classification of textual data by supervised learning algorithms. The aim is to study how a better textual data representation can improve the quality of classification. Considering that a word meaning depends on its context, we propose to use features that give important information about word contexts. We present a method named GenDesc, which generalizes (with POS tags) the least relevant words for the classification task. © 2013 Springer-Verlag Berlin Heidelberg.

Cite

CITATION STYLE

APA

Tisserant, G., Prince, V., & Roche, M. (2013). GenDesc: A partial generalization of linguistic features for text classification. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7934 LNCS, pp. 343–348). https://doi.org/10.1007/978-3-642-38824-8_35

GenDesc: A partial generalization of linguistic features for text classification

Abstract

Cite

Register to see more suggestions