This work investigates spoken language understanding (SLU) systems in the scenario when the semantic information is extracted directly from the speech signal by means of a single end-to-end neural network model. Two SLU tasks are considered: named entity recognition (NER) and semantic slot filling (SF). For these tasks, in order to improve the model performance, we explore various techniques including speaker adaptation, a modification of the connectionist temporal classification (CTC) training criterion, and sequential pretraining.
CITATION STYLE
Tomashenko, N., Caubrière, A., Estève, Y., Laurent, A., & Morin, E. (2019). Recent Advances in End-to-End Spoken Language Understanding. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11816 LNAI, pp. 44–55). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-030-31372-2_4
Mendeley helps you to discover research relevant for your work.