Narrowband multi-source direction-of-arrival estimation in the spherical harmonic domain

  • Hafezi S
  • Moore A
  • Naylor P
0Citations
Citations of this article
9Readers
Mendeley users who have this article in their library.

Abstract

A conventional approach to wideband multi-source (MS) direction-of-arrival (DOA) estimation is to perform single source (SS) DOA estimation in time-frequency (TF) bins for which a SS assumption is valid. Such methods use the W-disjoint orthogonality (WDO) assumption due to the speech sparseness. As the number of sources increases, the chance of violating the WDO assumption increases. As shown in the challenging scenarios with multiple simultaneously active sources over a short period of time masking each other, it is possible for a strongly masked source (due to inconsistency of activity or quietness) to be rarely dominant in a TF bin. SS-based DOA estimators fail in the detection or accurate localization of masked sources in such scenarios. Two analytical approaches are proposed for narrowband DOA estimation based on the MS assumption in a bin in the spherical harmonic domain. In the first approach, eigenvalue decomposition is used to decompose a MS scenario into multiple SS scenarios, and a SS-based analytical DOA estimation is performed on each. The second approach analytically estimates two DOAs per bin assuming the presence of two active sources per bin. The evaluation validates the improvement to double accuracy and robustness to sensor noise compared to the baseline methods.

Cite

CITATION STYLE

APA

Hafezi, S., Moore, A. H., & Naylor, P. A. (2021). Narrowband multi-source direction-of-arrival estimation in the spherical harmonic domain. The Journal of the Acoustical Society of America, 149(4), 2292–2303. https://doi.org/10.1121/10.0004214

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free