Bio-inspired sparse representation of speech and audio using psychoacoustic adaptive matching pursuit

Alexey Petrovsky; Vadzim Herasimovich; Alexander Petrovsky

Conference Proceedings

Bio-inspired sparse representation of speech and audio using psychoacoustic adaptive matching pursuit

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2016) 9811 LNCS 156-164

DOI: 10.1007/978-3-319-43958-7_18

3Citations

2Readers

Get full text

Abstract

Current paper devoted to the sparse audio and speech signal modelling via the matching pursuit (MP) algorithm. Redundant dictionary of the time-frequency functions is constructed through the frame-based psychoacoustic optimized wavelet packet (WP) transform. Anthropomorphic adaptation of the time-frequency plan allows minimizing perceptual redundancy of the signal modelling. Psychoacoustic information at MP stage for the best atom selection from the dictionary is used. It improves algorithm performance in terms of human hearing system and computational complexity. Described signal model can be applied in many audio and speech processing tasks such as source separation, watermarking, classification and so on. Presented research focused on the signal encoding. Universal audio/speech coding algorithm that is suitable for the input signals with different sound content is proposed.

Author supplied keywords

Cite

CITATION STYLE

APA

Petrovsky, A., Herasimovich, V., & Petrovsky, A. (2016). Bio-inspired sparse representation of speech and audio using psychoacoustic adaptive matching pursuit. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9811 LNCS, pp. 156–164). Springer Verlag. https://doi.org/10.1007/978-3-319-43958-7_18

Bio-inspired sparse representation of speech and audio using psychoacoustic adaptive matching pursuit

Abstract

Author supplied keywords

Cite

Register to see more suggestions