The paper presents a speaker detection system based on phoneme specific hidden Markov model in combination with Gaussian mixture model. Our motivation stems from the fact that the phoneme specific HMM system can model temporal variations and provides possibility to ponder the scores of specific phonemes as well as efficient pruning. The performance of the system has been evaluated on speech database which contains utterances in Serbian from 250 speakers (1 0 of them being the target speakers). The proposed model is compared to a system based on Gaussian mixture model - universal background model, and showed a significant improvement in detection performance.
CITATION STYLE
Pakoci, E., Jakovljević, N., Popović, B., Mišković, D., & Pekar, D. (2014). Speaker detection using phoneme specific hidden markov models. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8773, pp. 410–417). Springer Verlag. https://doi.org/10.1007/978-3-319-11581-8_51
Mendeley helps you to discover research relevant for your work.