How well do computers solve math word problems? Large-scale dataset construction and evaluation

Danqing Huang; Shuming Shi; Chin Yew Lin; Jian Yin; Wei Ying Ma

Conference Proceedings

How well do computers solve math word problems? Large-scale dataset construction and evaluation

54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Long Papers (2016) 2 887-896

DOI: 10.18653/v1/p16-1084

131Citations

165Readers

Get full text

Abstract

Recently a few systems for automatically solving math word problems have reported promising results. However, the datasets used for evaluation have limitations in both scale and diversity. In this paper, we build a large-scale dataset which is more than 9 times the size of previous ones, and contains many more problem types. Problems in the dataset are semiautomatically obtained from community question-answering (CQA) web pages. A ranking SVM model is trained to automatically extract problem answers from the answer text provided by CQA users, which significantly reduces human annotation cost. Experiments conducted on the new dataset lead to interesting and surprising results.

Cite

CITATION STYLE

APA

Huang, D., Shi, S., Lin, C. Y., Yin, J., & Ma, W. Y. (2016). How well do computers solve math word problems? Large-scale dataset construction and evaluation. In 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Long Papers (Vol. 2, pp. 887–896). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/p16-1084

How well do computers solve math word problems? Large-scale dataset construction and evaluation

Abstract

Cite

Register to see more suggestions