In this paper, we introduce a kernel-based approach to question classification. We employed a kernel function based on latent semantic information acquired from Wikipedia. This kernel allows including external semantic knowledge into the supervised learning process. We obtained a highly effective question classifier combining this knowledge with a bag-of-words approach by means of composite kernels. As the semantic information is acquired from unlabeled text, our system can be easily adapted to different languages and domains. We tested it on a parallel corpus of English and Spanish questions. © 2011 Springer-Verlag.
CITATION STYLE
Tomás, D., & Giuliano, C. (2011). Exploiting unlabeled data for question classification. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6716 LNCS, pp. 137–144). https://doi.org/10.1007/978-3-642-22327-3_13
Mendeley helps you to discover research relevant for your work.