In this paper, we present a novel software framework for recording audio-visual speech corpora with a high-speed video camera (JAI Pulnix RMC-6740) and a dynamic microphone (Oktava MK-012) Architecture of the developed software framework for recording audio-visual Russian speech corpus is described. It provides synchronization and fusion of audio and video data captured by the independent sensors. The software automatically detects voice activity in audio signal and stores only speech fragments discarding non-informative signals. It takes into account and processes natural asynchrony of audio-visual speech modalities as well.
CITATION STYLE
Karpov, A., Kipyatkova, I., & Železný, M. (2014). A framework for recording audio-visual speech corpora with a microphone and a high-speed camera. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8773, pp. 50–57). Springer Verlag. https://doi.org/10.1007/978-3-319-11581-8_6
Mendeley helps you to discover research relevant for your work.