Speech analytics for medical applications

Isabel Trancoso; Joana Correia; Francisco Teixeira; Bhiksha Raj; Alberto Abad

Conference Proceedings

Speech analytics for medical applications

Lecture Notes in Computer Science (2018) 11107 LNAI 26-37

DOI: 10.1007/978-3-030-00794-2_3

1Citations

18Readers

Get full text

Abstract

Speech has the potential to provide a rich bio-marker for health, allowing a non-invasive route to early diagnosis and monitoring of a range of conditions related to human physiology and cognition. With the rise of speech related machine learning applications over the last decade, there has been a growing interest in developing speech based tools that perform non-invasive diagnosis. This talk covers two aspects related to this growing trend. One is the collection of large in-the-wild multimodal datasets in which the speech of the subject is affected by certain medical conditions. Our mining effort has been focused on video blogs (vlogs), and explores audio, video, text and metadata cues, in order to retrieve vlogs that include a single speaker which, at some point, admits that he/she is currently affected by a given disease. The second aspect is patient privacy. In this context, we explore recent developments in cryptography and, in particular in Fully Homomorphic Encryption, to develop an encrypted version of a neural network trained with unencrypted data, in order to produce encrypted predictions of health-related labels. As a proof-of-concept, we have selected two target diseases: Cold and Depression, to show our results and discuss these two aspects.

Author supplied keywords

Cite

CITATION STYLE

APA

Trancoso, I., Correia, J., Teixeira, F., Raj, B., & Abad, A. (2018). Speech analytics for medical applications. In Lecture Notes in Computer Science (Vol. 11107 LNAI, pp. 26–37). Springer Verlag. https://doi.org/10.1007/978-3-030-00794-2_3

Speech analytics for medical applications

Abstract

Author supplied keywords

Cite

Register to see more suggestions