MWP-BERT: Numeracy-Augmented Pre-training for MathWord Problem Solving

65Citations
Citations of this article
66Readers
Mendeley users who have this article in their library.

Abstract

Math word problem (MWP) solving faces a dilemma in number representation learning. In order to avoid the number representation issue and reduce the search space of feasible solutions, existing works striving forMWPsolving usually replace real numbers with symbolic placeholders to focus on logic reasoning. However, different from common symbolic reasoning tasks like program synthesis and knowledge graph reasoning, MWP solving has extra requirements in numerical reasoning. In other words, instead of the number value itself, it is the reusable numerical property that matters more in numerical reasoning. Therefore, we argue that injecting numerical properties into symbolic placeholders with contextualized representation learning schema can provide a way out of the dilemma in the number representation issue here. In this work, we introduce this idea to the popular pre-training language model (PLM) techniques and build MWP-BERT, an effective contextual number representation PLM.We demonstrate the effectiveness of our MWP-BERT on MWP solving and several MWP-specific understanding tasks on both English and Chinese benchmarks.

Cite

CITATION STYLE

APA

Liang, Z., Zhang, J., Wang, L., Qin, W., Lan, Y., Shao, J., & Zhang, X. (2022). MWP-BERT: Numeracy-Augmented Pre-training for MathWord Problem Solving. In Findings of the Association for Computational Linguistics: NAACL 2022 - Findings (pp. 997–1009). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2022.findings-naacl.74

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free