HAVRUS corpus: High-speed recordings of audio-visual Russian speech

Vasilisa Verkhodanova; Alexander Ronzhin; Irina Kipyatkova; Denis Ivanko; Alexey Karpov; Miloš Železnỳ

Conference Proceedings

HAVRUS corpus: High-speed recordings of audio-visual Russian speech

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2016) 9811 LNCS 338-345

DOI: 10.1007/978-3-319-43958-7_40

26Citations

10Readers

Get full text

Abstract

In this paper we present a software-hardware complex for collection of audio-visual speech databases with a high-speed camera and a dynamic microphone. We describe the architecture of the developed software as well as some details of the collected database of Russian audio-visual speech HAVRUS. The developed software provides synchronization and fusion of both audio and video channels and makes allowance for and processes the natural factor of human speech - the asynchrony of audio and visual speech modalities. The collected corpus comprises recordings of 20 native speakers of Russian and is meant for further research and experiments on audio-visual Russian speech recognition.

Author supplied keywords

Cite

CITATION STYLE

APA

Verkhodanova, V., Ronzhin, A., Kipyatkova, I., Ivanko, D., Karpov, A., & Železnỳ, M. (2016). HAVRUS corpus: High-speed recordings of audio-visual Russian speech. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9811 LNCS, pp. 338–345). Springer Verlag. https://doi.org/10.1007/978-3-319-43958-7_40

HAVRUS corpus: High-speed recordings of audio-visual Russian speech

Abstract

Author supplied keywords

Cite

Register to see more suggestions