The two-dimensional spectro-temporal modulation filtering concept of the auditory model [T. Chi, P. Ru, and S. A. Shamma, J. Acoust. Soc. Am. 118(2), 887–906 (2005)] is implemented on the Fourier spectrogram. The Fourier magnitude spectrogram is analyzed in terms of its joint spectro-temporal modulations, which embed the temporal dynamics and spectral structures. Instead of iterative projection methods, the overlap-and-add method is adopted to invert modified Fourier spectrograms back to sounds. The proposed framework not only provides a similar spectro-temporal analytical process for sounds as the auditory model but also produces synthesized sounds with better quality in a timely manner, which makes proposed framework feasible to human speech recognition (HSR) applications as well.
CITATION STYLE
Chi, T.-S., & Hsu, C.-C. (2011). Multiband analysis and synthesis of spectro-temporal modulations of Fourier spectrogram. The Journal of the Acoustical Society of America, 129(5), EL190–EL196. https://doi.org/10.1121/1.3565471
Mendeley helps you to discover research relevant for your work.