Abstract
Test-question/answer retrieval task has raised higher requirements in terms of accuracy, coverage and semantic understanding. We design a cascade model with two-stage training processes: The first stage uses 41,532 user test-question click records and 207,660 unclick records, which are collected from a designed test-question-answer experimental platform, to generate 200,000 pairwise training dataset to train a deep learning model, which could improve generalization ability. The second stage combines the output of the first stage with structural knowledge as new features to train a logistic regression for selecting the results from the candidates with higher accuracy, the training dataset is generated by manually annotating 20,000 test-question samples. The structural knowledge is also manually extracted from the samples for generating a small knowledge graph, and on this condition, we design knowledge features. Experimental results show that the proposed model outperforms the state-of-the-art algorithms, among which the cascading model contributes 3% improvement and the knowledge features contribute 1% improvement.
Author supplied keywords
Cite
CITATION STYLE
Wei, Y., Li, D., & Madden, A. D. (2019). A knowledge based two-stage cascade model for test-question/answer retrieval. In Multi Conference on Computer Science and Information Systems, MCCSIS 2019 - Proceedings of the International Conferences on Big Data Analytics, Data Mining and Computational Intelligence 2019 and Theory and Practice in Modern Computing 2019 (pp. 23–30). IADIS Press. https://doi.org/10.33965/bigdaci2019_201907l003
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.