State of the Art speaker recognition methods are mainly based on GMM/UBM based supervector paradigm. Recently, a simple representation of speech based on local binary decision taken on each acoustic frame have been proposed, allowing to represent a speech excerpt as a binary matrix. This article is based on a similar approach. A new temporal block representation of the binary transformed data as well as three simple algorithms to obtain an efficient similarity measure are proposed. The experimental results show a better robustness of the proposed approach and a similar or better overall performance over classical approaches. © 2012 Springer-Verlag.
CITATION STYLE
Hernández-Sierra, G., Bonastre, J. F., & Calvo De Lara, J. R. (2012). Speaker recognition using a binary representation and specificities models. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7441 LNCS, pp. 732–739). https://doi.org/10.1007/978-3-642-33275-3_90
Mendeley helps you to discover research relevant for your work.