Joint Audio-Video Signal Processing for Object Localization and Tracking

Norbert Strobel; Sascha Spors; Rudolf Rabenstein

Book Chapter

Joint Audio-Video Signal Processing for Object Localization and Tracking

Strobel N
Spors S
Rabenstein R

DOI: 10.1007/978-3-662-04619-7_10

N/ACitations

2Readers

Get full text

Abstract

Applications such as videoconferencing, automatic scene analysis, or security surveillance involving acoustic sources can benefit from object localization within a complex scene. Many single-sensor techniques already exist for this purpose. They are, e.g., based on microphone arrays, video cameras, or range sensors. Since all of these sensors have their specific strengths and weaknesses, it is often advantageous to combine information from various sensor modalities to arrive at more robust position estimates.This chapter presents a joint audio-video signal processing methodology for object localizing and tracking. The approach is based on a decentralized Kalman filter structure modified such that different sensor measurement models can be incorporated. Such a situation is typical for combined audio-video sensing, since different coordinate systems are usually used for the camera system and the microphone array.At first, the decentralized estimation algorithm is presented. Then a speaker localization example is discussed. Finally, some estimation results are shown.

Cite

CITATION STYLE

APA

Strobel, N., Spors, S., & Rabenstein, R. (2001). Joint Audio-Video Signal Processing for Object Localization and Tracking (pp. 203–225). https://doi.org/10.1007/978-3-662-04619-7_10

Joint Audio-Video Signal Processing for Object Localization and Tracking

Abstract

Cite

Register to see more suggestions