Gabor frames and deep scattering networks in audio processing

2Citations
Citations of this article
10Readers
Mendeley users who have this article in their library.

Abstract

This paper introduces Gabor scattering, a feature extractor based on Gabor frames and Mallat's scattering transform. By using a simple signal model for audio signals, specific properties of Gabor scattering are studied. It is shown that, for each layer, specific invariances to certain signal characteristics occur. Furthermore, deformation stability of the coefficient vector generated by the feature extractor is derived by using a decoupling technique which exploits the contractivity of general scattering networks. Deformations are introduced as changes in spectral shape and frequencymodulation. The theoretical results are illustrated by numerical examples and experiments.Numerical evidence is given by evaluation on a synthetic and a "real" dataset, that the invariance encoded by the Gaborscattering transform lead to higher performance in comparison with just using Gabor transform, especially when few training samples are available.

Cite

CITATION STYLE

APA

Bammer, R., Dörfler, M., & Harar, P. (2019). Gabor frames and deep scattering networks in audio processing. Axioms, 8(4). https://doi.org/10.3390/axioms8040106

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free