SoDA: On-device Conversational Slot Extraction

Sujith Ravi; Zornitsa Kozareva

Conference ProceedingsOPEN ACCESS

SoDA: On-device Conversational Slot Extraction

SIGDIAL 2021 - 22nd Annual Meeting of the Special Interest Group on Discourse and Dialogue, Proceedings of the Conference (2021) 56-65

DOI: 10.18653/v1/2021.sigdial-1.7

2Citations

47Readers

Abstract

We propose a novel on-device neural sequence labeling model which uses embedding-free projections and character information to construct compact word representations to learn a sequence model using a combination of bidirectional LSTM with self-attention and CRF. Unlike typical dialog models that rely on huge, complex neural network architectures and large-scale pre-trained Transformers to achieve state-of-the-art results, our method achieves comparable results to BERT and even outperforms its smaller variant DistilBERT on conversational slot extraction tasks. Our method is faster than BERT models while achieving significant model size reduction-our model requires 135x and 81x fewer model parameters than BERT and DistilBERT, respectively. We conduct experiments on multiple conversational datasets and show significant improvements over existing methods including recent on-device models. Experimental results and ablation studies also show that our neural models preserve tiny memory footprint necessary to operate on smart devices, while still maintaining high performance.

Cite

CITATION STYLE

APA

Ravi, S., & Kozareva, Z. (2021). SoDA: On-device Conversational Slot Extraction. In SIGDIAL 2021 - 22nd Annual Meeting of the Special Interest Group on Discourse and Dialogue, Proceedings of the Conference (pp. 56–65). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.sigdial-1.7

SoDA: On-device Conversational Slot Extraction

Abstract

Cite

Register to see more suggestions