Robust feature combination for speech recognition using linear microphone array in a car

0Citations
Citations of this article
1Readers
Mendeley users who have this article in their library.
Get full text

Abstract

When speech recognition is performed in a car environment, there are two important robustness issues that should be taken into account. The first robustness is related to the noisy acoustic condition, and it has been one of the most popular research topics of in-vehicle speech recognition. In contrast, the second robustness, which is related to unstable calibration of the audio input, has not attracted much attention. Consequently, the performance of speech recognition would degrade greatly in a real application if the input device such as a microphone array is badly calibrated. We propose robust feature combination in the MFCC domain using speech inputs from a linear microphone array. It realizes robust (from both the noise and the calibration viewpoints) and practical speech recognition applications in car environments. Even a simple MFCC averaging approach is effective, and a new algorithm, hypothesis-based feature combination (HBFC), improves the performance. We also extend cepstral variance normalization as variance re-scaling, which makes the feature combination approach more robust. The advantages of the proposed algorithms are confirmed by the experiments using the data recorded in a moving car. © 2009 Springer US.

Cite

CITATION STYLE

APA

Obuchi, Y., & Hataoka, N. (2009). Robust feature combination for speech recognition using linear microphone array in a car. In In-Vehicle Corpus and Signal Processing for Driver Behavior (pp. 187–196). Springer Science and Business Media, LLC. https://doi.org/10.1007/978-0-387-79582-9_15

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free