Bidirectional Transformer Based Multi-Task Learning for Natural Language Understanding

Suraj Tripathi; Chirag Singh; Abhay Kumar; Chandan Pandey; Nishant Jain

Conference Proceedings

Bidirectional Transformer Based Multi-Task Learning for Natural Language Understanding

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2019) 11608 LNCS 54-65

DOI: 10.1007/978-3-030-23281-8_5

7Citations

9Readers

Get full text

Abstract

We propose a multi-task learning based framework for natural language understanding tasks like sentiment and topic classification. We make use of bidirectional transformer based architecture to generate encoded representations from given input followed by task-specific layers for classification. Multi-Task learning (MTL) based framework make use of a different set of tasks in parallel, as a kind of additional regularization, to improve the generalizability of the trained model over individual tasks. We introduced a task-specific auxiliary problem using the k-means clustering algorithm to be trained in parallel with main tasks to reduce the model’s generalization error on the main task. POS-tagging was also used as one of the auxiliary tasks. We also trained multiple benchmark classification datasets in parallel to improve the effectiveness of our bidirectional transformer based network across all the datasets. Our proposed MTL based transformer network improved state-of-the-art overall accuracy of Movie Review (MR), AG News, and Stanford Sentiment Treebank (SST-2) corpus by 6%, 1.4%, and 3.3% respectively.

Author supplied keywords

Cite

CITATION STYLE

APA

Tripathi, S., Singh, C., Kumar, A., Pandey, C., & Jain, N. (2019). Bidirectional Transformer Based Multi-Task Learning for Natural Language Understanding. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11608 LNCS, pp. 54–65). Springer Verlag. https://doi.org/10.1007/978-3-030-23281-8_5

Bidirectional Transformer Based Multi-Task Learning for Natural Language Understanding

Abstract

Author supplied keywords

Cite

Register to see more suggestions