Why text segment classification based on part of speech feature selection

Iulia Nagy; Katsuyuki Tanaka; Yasuo Ariki

Conference Proceedings

Why text segment classification based on part of speech feature selection

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2010) 6332 LNAI 87-101

DOI: 10.1007/978-3-642-16184-1_7

2Citations

3Readers

Get full text

Abstract

The aim of our research is to develop a scalable automatic why question answering system for English based on supervised method that uses part of speech analysis. The prior approach consisted in building a why-classifier using function words. This paper investigates the performance of combining supervised data mining methods with various feature selection strategies in order to obtain a more accurate why classifier.Feature selection was performed a priori on the dataset to extract representative verbs and/or nouns and avoid the dimensionality curse. LogitBoost and SVM were used for the classification process. Three methods of extending the initial "function words only" approach, to handle context-dependent features, are proposed and experimentally evaluated on various datasets. The first considers function words and context-independent adverbs; the second incorporates selected lemmatized verbs; the third contains selected lemmatized verbs & nouns. Experiments on web-extracted datasets showed that all methods performed better than the baseline, with slightly more reliable results for the third one. © 2010 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Nagy, I., Tanaka, K., & Ariki, Y. (2010). Why text segment classification based on part of speech feature selection. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6332 LNAI, pp. 87–101). https://doi.org/10.1007/978-3-642-16184-1_7

Why text segment classification based on part of speech feature selection

Abstract

Author supplied keywords

Cite

Register to see more suggestions