Deep learning based question answering system in Bengali

52Citations
Citations of this article
86Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Recent advances in the field of natural language processing has improved state-of-the-art performances on many tasks including question answering for languages like English. Bengali language is ranked seventh and is spoken by about 300 million people all over the world. But due to lack of data and active research on QA similar progress has not been achieved for Bengali. Unlike English, there is no benchmark large scale QA dataset collected for Bengali, no pretrained language model that can be modified for Bengali question answering and no human baseline score for QA has been established either. In this work we use state-of-the-art transformer models to train QA system on a synthetic reading comprehension dataset translated from one of the most popular benchmark datasets in English called SQuAD 2.0. We collect a smaller human annotated QA dataset from Bengali Wikipedia with popular topics from Bangladeshi culture for evaluating our models. Finally, we compare our models with human children to set up a benchmark score using survey experiments.

Cite

CITATION STYLE

APA

Tahsin Mayeesha, T., Md Sarwar, A., & Rahman, R. M. (2021). Deep learning based question answering system in Bengali. Journal of Information and Telecommunication, 5(2), 145–178. https://doi.org/10.1080/24751839.2020.1833136

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free