Speaker Diarization based on Black-Hole Entropy Fuzzy Clustering using Cepstral Features

  • et al.
N/ACitations
Citations of this article
2Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Speaker diarization is the process of identification of the speaker in an audio sequence. This paper proposed a speaker diarization method using the Black-hole entropy fuzzy clustering and multiple kernel weighted Mel frequency cepstral coefficient (MKMFCC) parameterization. Initially, the MKMFCC descriptor extracted the cepstral features from the input audio signal. These features are used for clustering the speakers as groups for which the BHEFC is used. The feature parameter uses the audio signal containing both the high and low energy frame for speaker indexing that resulted in accurate separation of speaker. The performance evaluation of the proposed speaker diarization system is analyzed using the measures, such as F-measure, diarization error rate, and false alarm rate. The proposed MKMFCC with BHEFC obtained a minimum diarization error rate of 0.2447, maximum F-measure of 0.8526 and minimum false alarm rate of 0.4299, respectively while changing the wavelength and obtained a minimum diarization error rate of 0.2447, maximum F-measure of 0.8526 and minimum false alarm rate of 0.4298 when compared to the existing methods for the change in the frame length.

Cite

CITATION STYLE

APA

Ramaiah, V. S., Rao, S. S., & Devaraju, V. S. N. K. (2020). Speaker Diarization based on Black-Hole Entropy Fuzzy Clustering using Cepstral Features. International Journal of Engineering and Advanced Technology, 9(4), 1055–1061. https://doi.org/10.35940/ijeat.d7832.049420

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free