Joint Audio-Video Object Tracking

Conference contribution

Publication Details

Author(s): Spors S, Rabenstein R, Strobel N
Publisher: Institute of Electrical and Electronic Engineers
Publication year: 2001
Volume: 18
Journal issue: 1
Pages range: 22-31
ISSN: 1053-5888


A method for joint localization of a single object using an audio sensor and a video sensor were outlined. These localizers are best described using different coordinate systems as the audio position estimates are nonlinearly related to the state. Whereas video sensor system operates in Cartesian coordinates. The local Kalman filters consists of three main components: local Kalman filter at the microphone array; a local Kalman filter at the video camera; and a global Kalman filter at the fusion center. The loss of optimality is the price to pay for replacing the linear Kalman filter with an extended Kalman filter. Quantitative experiments have shown that such a strategy can improve object localization and tracking.

FAU Authors / FAU Editors

Rabenstein, Rudolf Prof. Dr.
Lehrstuhl für Multimediakommunikation und Signalverarbeitung

How to cite

Spors, S., Rabenstein, R., & Strobel, N. (2001). Joint Audio-Video Object Tracking. (pp. 22-31). Thessaloniki, GR: Institute of Electrical and Electronic Engineers.

Spors, Sascha, Rudolf Rabenstein, and Norbert Strobel. "Joint Audio-Video Object Tracking." Proceedings of the IEEE International Conference on Image Processing (ICIP), Thessaloniki Institute of Electrical and Electronic Engineers, 2001. 22-31.


Last updated on 2018-10-08 at 09:09