A comparable study on PNCC in speaker diarization for meetings

1Citations
Citations of this article
9Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In speaker diarization, the most commonly used speaker feature is MFCC, which is also most commonly used speech feature in speech recognition. The newly proposed Power Normalized Cepstrum Coefficients (PNCC) achieves impressive improvement in noisy speech recognition compare to MFCC. It consequently expects a proof for speaker diarization use. In this paper, PNCC is evaluated against MFCC in a meeting domain speaker diarization system. The Diarization Error Rate (DER) shows no positive results with PNCC. This is possibly because of their inhibition in high frequency spectrum which is believed to represents the characteristics of human's voice. An initial model training material select strategy is proposed and used in the speaker diarization system in this work. © 2010 IEEE.

Author supplied keywords

Cite

CITATION STYLE

APA

Li, Q., Fan, Q., Xiao, Y., & Ye, W. (2010). A comparable study on PNCC in speaker diarization for meetings. In Proceedings - 2010 1st ACIS International Symposium on Cryptography, and Network Security, Data Mining and Knowledge Discovery, E-Commerce and Its Applications, and Embedded Systems, CDEE 2010 (pp. 157–160). IEEE Computer Society. https://doi.org/10.1109/CDEE.2010.40

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free