Reference-based Weak Supervision for Answer Sentence Selection using Web Data

Vivek Krishnamurthy; Thuy Vu; Alessandro Moschitti

Conference ProceedingsOPEN ACCESS

Reference-based Weak Supervision for Answer Sentence Selection using Web Data

Findings of the Association for Computational Linguistics, Findings of ACL: EMNLP 2021 (2021) 4294-4299

DOI: 10.18653/v1/2021.findings-emnlp.363

1Citations

44Readers

Abstract

Answer Sentence Selection (AS2) models are core components of efficient retrieval-based Question Answering (QA) systems. We present the Reference-based Weak Supervision (RWS), a fully automatic large-scale data pipeline that harvests high-quality weaklysupervised answer sentences from Web data, only requiring a question-reference pair as input. We evaluated the quality of the RWS-derived data by training TANDA models, which are the state of the art for AS2. Our results show that the data consistently bolsters TANDA on three different datasets. In particular, we set the new state of the art for AS2 to P@1=90.1%, and MAP=92.9%, on WikiQA. We record similar performance gains of RWS on a much larger dataset named Web-based Question Answering (WQA).

Cite

CITATION STYLE

APA

Krishnamurthy, V., Vu, T., & Moschitti, A. (2021). Reference-based Weak Supervision for Answer Sentence Selection using Web Data. In Findings of the Association for Computational Linguistics, Findings of ACL: EMNLP 2021 (pp. 4294–4299). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.findings-emnlp.363

Reference-based Weak Supervision for Answer Sentence Selection using Web Data

Abstract

Cite

Register to see more suggestions