Abstract
Question classification (QC) is a prime constituent of anautomated question answering system. The work presented here demonstrates that a combination of multiple models achieves better classification performance than those obtained with existing individual models for the QC task in Bengali.We have exploited state-of-the-art multiple model combination techniques, i.e., ensemble, stacking and voting, to increase QC accuracy. Lexical, syntactic and semantic features of Bengali questions are used for four well-known classifiers, namely Naïve Bayes, kernel Naïve Bayes, Rule Induction and Decision Tree, which serve as our base learners. Single-layer question-class taxonomy with 8 coarse-grainedclasses is extended to two-layer taxonomy by adding 69 fine-grained classes. We carried out the experiments both on single-layer and two-layer taxonomies. Experimental results confirmed that classifier combination approaches outperform single-classifier classification approaches by 4.02% for coarse-grained question classes. Overall, the stacking approach produces the best results for fine-grained classification and achieves 87.79% of accuracy. The approach presented here could be used in other Indo-Aryan or Indic languages to develop a question answering system.
Author supplied keywords
Cite
CITATION STYLE
Banerjee, S., Naskar, S. K., Rosso, P., & Bndyopadhyay, S. (2019). Classifier combination approach for question classification for Bengali question answering system. Sadhana - Academy Proceedings in Engineering Sciences, 44(12). https://doi.org/10.1007/s12046-019-1224-8
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.