Abstract
The performance of speaker identification process degrades in reverberant environments, as reverberation leads to clear physical effects on the perceived signals. This paper investigates the effect of room reverberation on the identification rate. However, various reverberant environments are simulated, and the impulse response is convolved with dry speech signals. The reverberant speech database is used by the identification engine within the train and the test phases. Then, statistical identification technique using the Gaussian Mixture Model (GMM) is implemented. Three types of features, Mel-Frequency Cepstrum Coefficients (MFCC), Preceptual Linear Predictive Cepstrum Coefficients (PLPCC) and Relative Spectral Preceptual Linear Predictive Cepstrum Coefficients (RASTA-PLPCC) are extracted. Various types of features are integrated and used for the classification problem. Finally, the performance of the recognition process is evaluated while varying the duration of the train and the test signals, the features used for the classification problem, and the room reverberation. A series of physical measures that correlate with various attributes of the sound perceived in rooms, such as the reverberation time T60, the clarity index C80, the definition D, are calculated. Then, their effect on the identification rate is investigated. © 2013 Acoustical Society of America.
Cite
CITATION STYLE
Korany, N. (2013). Speaker identification in reverberant environments. In Proceedings of Meetings on Acoustics (Vol. 19). https://doi.org/10.1121/1.4799472
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.