Deep Learning in Speaker Recognition

1Citations
Citations of this article
4Readers
Mendeley users who have this article in their library.
Get full text

Abstract

It is supposed in Speaker Recognition (SR) that everyone has a unique voice which could be used as an identity rather than or in addition to other identities such as fingerprint, face, or iris. Even though steps have been taken long ago to apply neural networks in SR, recent advances in computing hardware, new deep learning (DL) architectures and training methods, and access to a large amount of training data have inspired the research community to make use of DL as in a large variety of other signal processing applications. In this chapter, the traditional principle techniques in SR are first briefly reviewed and the potential signal processing aspects of these techniques which can be improved by DL are addressed. Then the recent most successful DL architectures used in SR are introduced and some illustrative experiments from the authors are included.

Cite

CITATION STYLE

APA

Ghahabi, O., Safari, P., & Hernando, J. (2020). Deep Learning in Speaker Recognition. In Studies in Computational Intelligence (Vol. 867, pp. 145–169). Springer Verlag. https://doi.org/10.1007/978-3-030-31764-5_6

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free