An improved i-vector extraction algorithm for speaker verification

16Citations
Citations of this article
18Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Over recent years, i-vector-based framework has been proven to provide state-of-the-art performance in speaker verification. Each utterance is projected onto a total factor space and is represented by a low-dimensional feature vector. Channel compensation techniques are carried out in this low-dimensional feature space. Most of the compensation techniques take the sets of extracted i-vectors as input. By constructing between-class covariance and within-class covariance, we attempt to minimize the between-class variance mainly caused by channel effect and to maximize the variance between speakers. In the real-world application, enrollment and test data from each user (or speaker) are always scarce. Although it is widely thought that session variability is mostly caused by channel effects, phonetic variability, as a factor that causes session variability, is still a matter to be considered. We propose in this paper a new i-vector extraction algorithm from the total factor matrix which we term component reduction analysis (CRA). This new algorithm contributes to better modelling of session variability in the total factor space. We reported results on the male English trials of the core condition of the NIST 2008 Speaker Recognition Evaluation (SREs) dataset. As measured both by equal error rate and the minimum values of the NIST detection cost function, 10–15 % relative improvement is achieved compared to the baseline of traditional i-vector-based system.

References Powered by Scopus

Speaker verification using adapted Gaussian mixture models

4009Citations
N/AReaders
Get full text

Front-end factor analysis for speaker verification

3481Citations
N/AReaders
Get full text

Joint factor analysis versus eigenchannels in speaker recognition

649Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Comparison of Text Independent Speaker Identification Systems using GMM and i-Vector Methods

34Citations
N/AReaders
Get full text

Study of MFCC and IHC feature extraction methods with probabilistic acoustic models for speaker biometric applications

26Citations
N/AReaders
Get full text

Automatic Speaker Recognition from Speech Signals Using Self Organizing Feature Map and Hybrid Neural Network

26Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Li, W., Fu, T., & Zhu, J. (2015). An improved i-vector extraction algorithm for speaker verification. Eurasip Journal on Audio, Speech, and Music Processing, 2015(1), 1–9. https://doi.org/10.1186/s13636-015-0061-x

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 12

92%

Researcher 1

8%

Readers' Discipline

Tooltip

Computer Science 7

50%

Engineering 5

36%

Physics and Astronomy 1

7%

Arts and Humanities 1

7%

Save time finding and organizing research with Mendeley

Sign up for free