Bio-inspired sparse representation of speech and audio using psychoacoustic adaptive matching pursuit

3Citations
Citations of this article
2Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Current paper devoted to the sparse audio and speech signal modelling via the matching pursuit (MP) algorithm. Redundant dictionary of the time-frequency functions is constructed through the frame-based psychoacoustic optimized wavelet packet (WP) transform. Anthropomorphic adaptation of the time-frequency plan allows minimizing perceptual redundancy of the signal modelling. Psychoacoustic information at MP stage for the best atom selection from the dictionary is used. It improves algorithm performance in terms of human hearing system and computational complexity. Described signal model can be applied in many audio and speech processing tasks such as source separation, watermarking, classification and so on. Presented research focused on the signal encoding. Universal audio/speech coding algorithm that is suitable for the input signals with different sound content is proposed.

Cite

CITATION STYLE

APA

Petrovsky, A., Herasimovich, V., & Petrovsky, A. (2016). Bio-inspired sparse representation of speech and audio using psychoacoustic adaptive matching pursuit. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9811 LNCS, pp. 156–164). Springer Verlag. https://doi.org/10.1007/978-3-319-43958-7_18

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free