Impact of prior channel information for speaker identification

C. Vaquero; N. Scheffer; S. Karajekar

Conference ProceedingsOPEN ACCESS

Impact of prior channel information for speaker identification

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2009) 5558 LNCS 443-453

DOI: 10.1007/978-3-642-01793-3_46

1Citations

8Readers

Abstract

Joint factor analysis (JFA) has been very successful in speaker recognition but its success depends on the choice of development data. In this work, we apply JFA to a very diverse set of recording conditions and conversation modes in NIST 2008 SRE, showing that having channel matched development data will give improvements of about 50% in terms of Equal Error Rate against a Maximum a Posteriori (MAP) system, while not having it will not give significant improvement. To provide robustness to the system, we estimate eigenchannels in two ways. First, we estimate the eigenchannels separately for each condition and stack them. Second, we pool all the relevant development data and obtain a single estimate. Both techniques show good performance, but the former leads to lower performance when working with low-dimension channel subspaces, due to the correlation between those subspaces. © Springer-Verlag Berlin Heidelberg 2009.

Cite

CITATION STYLE

APA

Vaquero, C., Scheffer, N., & Karajekar, S. (2009). Impact of prior channel information for speaker identification. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5558 LNCS, pp. 443–453). https://doi.org/10.1007/978-3-642-01793-3_46

Impact of prior channel information for speaker identification

Abstract

Cite

Register to see more suggestions