Deep Learning in Speaker Recognition

Omid Ghahabi; Pooyan Safari; Javier Hernando

Book Chapter

Deep Learning in Speaker Recognition

Springer Verlag, (2020), 145-169

DOI: 10.1007/978-3-030-31764-5_6

1Citations

4Readers

Get full text

Abstract

It is supposed in Speaker Recognition (SR) that everyone has a unique voice which could be used as an identity rather than or in addition to other identities such as fingerprint, face, or iris. Even though steps have been taken long ago to apply neural networks in SR, recent advances in computing hardware, new deep learning (DL) architectures and training methods, and access to a large amount of training data have inspired the research community to make use of DL as in a large variety of other signal processing applications. In this chapter, the traditional principle techniques in SR are first briefly reviewed and the potential signal processing aspects of these techniques which can be improved by DL are addressed. Then the recent most successful DL architectures used in SR are introduced and some illustrative experiments from the authors are included.

Author supplied keywords

Cite

CITATION STYLE

APA

Ghahabi, O., Safari, P., & Hernando, J. (2020). Deep Learning in Speaker Recognition. In Studies in Computational Intelligence (Vol. 867, pp. 145–169). Springer Verlag. https://doi.org/10.1007/978-3-030-31764-5_6

Deep Learning in Speaker Recognition

Abstract

Author supplied keywords

Cite

Register to see more suggestions