Discriminating speakers by their voices — A fusion based approach

Halim Sayoud; Siham Ouamour; Zohra Hamadache

Conference Proceedings

Discriminating speakers by their voices — A fusion based approach

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2017) 10458 LNAI 322-331

DOI: 10.1007/978-3-319-66429-3_31

0Citations

3Readers

Get full text

Abstract

The task of Speaker Discrimination (SD) consists in checking whether two speech segments belong to the same speaker or not. In this research field, it is often difficult to decide what could be the best classifier in terms of accuracy and robustness. For that purpose, we have implemented 9 classifiers: Support Vector Machines, Linear Discriminant Analysis, Multi-Layer Perceptron, Generalized Linear Model, Self Organizing Map, Adaboost, Second Order Statistical Measures, Linear Regression and Gaussian Mixture Models. Furthermore, a new fusion approach is proposed and experimented in speaker discrimination. Several experiments of speaker discrimination were conducted on Hub4 Broadcast-News with relatively short segments. The obtained results have shown that the best classifier is the SVM and that the proposed fusion approach is quite interesting since it provided the best performances at all.

Author supplied keywords

Cite

CITATION STYLE

APA

Sayoud, H., Ouamour, S., & Hamadache, Z. (2017). Discriminating speakers by their voices — A fusion based approach. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10458 LNAI, pp. 322–331). Springer Verlag. https://doi.org/10.1007/978-3-319-66429-3_31

Discriminating speakers by their voices — A fusion based approach

Abstract

Author supplied keywords

Cite

Register to see more suggestions