Voice recognition and marking using mel-frequency cepstral coefficients

7Citations
Citations of this article
9Readers
Mendeley users who have this article in their library.

Abstract

A real-time voice recognition and marking system was developed in this study to automatically identify different voices of speakers. A microphone array was installed for audio reception. Pre-emphasis, framing and Hamming window, fast Fourier transform, mel-frequency, and mel-frequency cepstral coefficients with processing times of 0.001, 0.305, 0.205, 0.049, and 0.546 s, respectively, were used in the system. The total processing time was less than 1.5 s. Unique eigenvalues were obtained for each sound. The results indicated that the proposed system, which is an example of intelligent recording, can be used to automatically record speech in meetings or during classes.

Cite

CITATION STYLE

APA

Sheu, J. S., & Chen, C. W. (2020). Voice recognition and marking using mel-frequency cepstral coefficients. Sensors and Materials, 32(10), 3209–3220. https://doi.org/10.18494/SAM.2020.2860

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free