Abstract
Health-related speech datasets are often small and varied in focus. This makes it difficult to leverage them to effectively support healthcare goals. Robust transfer of linguistic features across different datasets orbiting the same goal carries potential to address this concern. To test this hypothesis, we experiment with domain adaptation (DA) techniques on heterogeneous spoken language data to evaluate generalizability across diverse datasets for a common task: dementia detection. We find that adapted models exhibit better performance across conversational and task-oriented datasets. The feature-augmented DA method achieves a 22% increase in accuracy adapting from a conversational to task-specific dataset compared to a jointly trained baseline. This suggests promising capacity of these techniques to allow for productive use of disparate data for a complex spoken language healthcare task.
Cite
CITATION STYLE
Farzana, S., & Parde, N. (2023). Towards Domain-Agnostic and Domain-Adaptive Dementia Detection from Spoken Language. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (Vol. 1, pp. 11965–11978). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.acl-long.668
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.