Fast and accurate deep bidirectional language representations for unsupervised learning

7Citations
Citations of this article
159Readers
Mendeley users who have this article in their library.

Abstract

Even though BERT has achieved successful performance improvements in various supervised learning tasks, BERT is still limited by repetitive inferences on unsupervised tasks for the computation of contextual language representations. To resolve this limitation, we propose a novel deep bidirectional language model called a Transformer-based Text Autoencoder (T-TA). The T-TA computes contextual language representations without repetition and displays the benefits of a deep bidirectional architecture, such as that of BERT. In computation time experiments in a CPU environment, the proposed T-TA performs over six times faster than the BERT-like model on a reranking task and twelve times faster on a semantic similarity task. Furthermore, the T-TA shows competitive or even better accuracies than those of BERT on the above tasks. Code is available at https://github.com/joongbo/tta.

Cite

CITATION STYLE

APA

Shin, J., Lee, Y., Yoon, S., & Jung, K. (2020). Fast and accurate deep bidirectional language representations for unsupervised learning. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 823–835). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2020.acl-main.76

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free