Enhancing pre-trained language representations with rich knowledge for machine reading comprehension

118Citations
Citations of this article
316Readers
Mendeley users who have this article in their library.

Abstract

Machine reading comprehension (MRC) is a crucial and challenging task in NLP. Recently, pre-trained language models (LMs), especially BERT, have achieved remarkable success, presenting new state-of-the-art results in MRC. In this work, we investigate the potential of leveraging external knowledge bases (KBs) to further improve BERT for MRC. We introduce KT-NET, which employs an attention mechanism to adaptively select desired knowledge from KBs, and then fuses selected knowledge with BERT to enable context- and knowledge-aware predictions. We believe this would combine the merits of both deep LMs and curated KBs towards better MRC. Experimental results indicate that KT-NET offers significant and consistent improvements over BERT, outperforming competitive baselines on ReCoRD and SQuAD1.1 benchmarks. Notably, it ranks the 1st place on the ReCoRD leaderboard, and is also the best single model on the SQuAD1.1 leaderboard at the time of submission (March 4th, 2019).

Cite

CITATION STYLE

APA

Yang, A., Wang, Q., Liu, J., Liu, K., Lyu, Y., Wu, H., … Li, S. (2020). Enhancing pre-trained language representations with rich knowledge for machine reading comprehension. In ACL 2019 - 57th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (pp. 2346–2357). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/p19-1226

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free