This paper proposes a novel front-end for automatic spoken language recognition, based on the spectrogram representation of the speech signal and in the properties of the Fourier spectrum to detect global periodicity in an image. Local Phase Quantization (LPQ) texture descriptor was used to capture the spectrogram content. Results obtained for 30 seconds test signal duration have shown that this method is very promising for low cost language identification. The best performance is achieved when our proposed method is fused with the i-vector representation.
CITATION STYLE
Montalvo, A., Costa, Y. M. G., & Calvo, J. R. (2015). Language identification using spectrogram texture. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9423, pp. 543–550). Springer Verlag. https://doi.org/10.1007/978-3-319-25751-8_65
Mendeley helps you to discover research relevant for your work.