Non-contact speech recovery technology using a 24 GHz portable auditory radar and webcam

Yue Ma; Hong Hong; Hui Li; Heng Zhao; Yusheng Li; Li Sun; Chen Gu; Xiaohua Zhu

Journal ArticleOPEN ACCESS

Non-contact speech recovery technology using a 24 GHz portable auditory radar and webcam

Remote Sensing (2020) 12(4)

DOI: 10.3390/rs12040653

7Citations

10Readers

Abstract

Language has been one of the most effective ways of human communication and information exchange. To solve the problem of non-contact robust speech recognition, recovery, and surveillance, this paper presents a speech recovery technology based on a 24 GHz portable auditory radar and webcam. The continuous-wave auditory radar is utilized to extract the vocal vibration signal, and the webcam is used to obtain the fitted formant frequency. The traditional formant speech synthesizer is selected to synthesize and recover speech, using the vocal vibration signal as the sound source excitation and the fitted formant frequency as the vocal tract resonance characteristics. Experiments on reading single English characters and words are carried out. Using microphone records as a reference, the effectiveness of the proposed speech recovery technology is verified. Mean opinion scores show a relatively high consistency between the synthesized speech and original acoustic speech.

Author supplied keywords

Cite

CITATION STYLE

APA

Ma, Y., Hong, H., Li, H., Zhao, H., Li, Y., Sun, L., … Zhu, X. (2020). Non-contact speech recovery technology using a 24 GHz portable auditory radar and webcam. Remote Sensing, 12(4). https://doi.org/10.3390/rs12040653

Non-contact speech recovery technology using a 24 GHz portable auditory radar and webcam

Abstract

Author supplied keywords

Cite

Register to see more suggestions