Exploiting unlabeled data for question classification

David Tomás; Claudio Giuliano

Conference Proceedings

Exploiting unlabeled data for question classification

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2011) 6716 LNCS 137-144

DOI: 10.1007/978-3-642-22327-3_13

1Citations

2Readers

Get full text

Abstract

In this paper, we introduce a kernel-based approach to question classification. We employed a kernel function based on latent semantic information acquired from Wikipedia. This kernel allows including external semantic knowledge into the supervised learning process. We obtained a highly effective question classifier combining this knowledge with a bag-of-words approach by means of composite kernels. As the semantic information is acquired from unlabeled text, our system can be easily adapted to different languages and domains. We tested it on a parallel corpus of English and Spanish questions. © 2011 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Tomás, D., & Giuliano, C. (2011). Exploiting unlabeled data for question classification. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6716 LNCS, pp. 137–144). https://doi.org/10.1007/978-3-642-22327-3_13

Exploiting unlabeled data for question classification

Abstract

Author supplied keywords

Cite

Register to see more suggestions