Multi-channel i-vector combination for robust speaker verification in multi-room domestic environments

2Citations
Citations of this article
6Readers
Mendeley users who have this article in their library.

Abstract

In this work we address the speaker verification task in domestic environments where multiple rooms are monitored by a set of distributed microphones. In particular, we focus on the mismatch between the training of the total variability feature extraction hyper-parameters, the enrolment stage, which occurs at a fixed position in the home, and the test phase which could happen in any location of the apartment. Building upon a previous work, where a position independent multi-channel verification system was introduced, we investigate different i-vector combination strategies to attenuate the effects of the above mentioned mismatch sources. The proposed methods implicitly select the microphones in the room where the speaker is, without any knowledge about the speaker position. An experimental analysis on a simulated multi-channel multi-room reverberant data-set shows that the proposed solutions are robust against changes in the speaker position and orientation, achieving performance close to an upper-bound based on knowledge about the speaker location.

Cite

CITATION STYLE

APA

Brutti, A., & Abad, A. (2016). Multi-channel i-vector combination for robust speaker verification in multi-room domestic environments. In Odyssey 2016: Speaker and Language Recognition Workshop (pp. 252–258). International Speech Communication Association. https://doi.org/10.21437/Odyssey.2016-36

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free