A progressive model to enable continual learning for semantic slot filling

Yilin Shen; Xiangyu Zeng; Hongxia Jin

Conference ProceedingsOPEN ACCESS

A progressive model to enable continual learning for semantic slot filling

EMNLP-IJCNLP 2019 - 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing, Proceedings of the Conference (2019) 1279-1284

DOI: 10.18653/v1/D19-1126

26Citations

105Readers

Abstract

Semantic slot filling is one of the major tasks in spoken language understanding (SLU). After a slot filling model is trained on pre-collected data, it is crucial to continually improve the model after deployment to learn users' new expressions. As the data amount grows, it becomes infeasible to either store such huge data and repeatedly retrain the model on all data or fine tune the model only on new data without forgetting old expressions. In this paper, we introduce a novel progressive slot filling model, ProgModel. ProgModel consists of a novel context gate that transfers previously learned knowledge to a small size expanded component; and meanwhile enables this new component to be fast trained to learn from new data. As such, ProgModel learns the new knowledge by only using new data at each time and meanwhile preserves the previously learned expressions. Our experiments show that ProgModel needs much less training time and smaller model size to outperform various model fine tuning competitors by up to 4.24% and 3.03% on two benchmark datasets.

Cite

CITATION STYLE

APA

Shen, Y., Zeng, X., & Jin, H. (2019). A progressive model to enable continual learning for semantic slot filling. In EMNLP-IJCNLP 2019 - 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing, Proceedings of the Conference (pp. 1279–1284). Association for Computational Linguistics. https://doi.org/10.18653/v1/D19-1126

A progressive model to enable continual learning for semantic slot filling

Abstract

Cite

Register to see more suggestions