DopplerBAS: Binaural Audio Synthesis Addressing Doppler Effect

1Citations
Citations of this article
15Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Recently, binaural audio synthesis (BAS) has emerged as a promising research field for its applications in augmented and virtual realities. Binaural audio helps users orient themselves and establish immersion by providing the brain with interaural time differences reflecting spatial information. However, existing BAS methods are limited in terms of phase estimation, which is crucial for spatial hearing. In this paper, we propose the DopplerBAS method to explicitly address the Doppler effect of the moving sound source. Specifically, we calculate the radial relative velocity of the moving speaker in spherical coordinates, which further guides the synthesis of binaural audio. This simple method introduces no additional hyper-parameters and does not modify the loss functions, and is plug-and-play: it scales well to different types of backbones. DopperBAS distinctly improves the representative WarpNet and BinauralGrad backbones in the phase error metric and reaches a new state of the art (SOTA): 0.780 (versus the current SOTA 0.807). Experiments and ablation studies demonstrate the effectiveness of our method.

Cite

CITATION STYLE

APA

Liu, J., Ye, Z., Chen, Q., Zheng, S., Wang, W., Zhang, Q., & Zhao, Z. (2023). DopplerBAS: Binaural Audio Synthesis Addressing Doppler Effect. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 11905–11912). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.findings-acl.753

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free