TU Berlin

Quality and Usability LabReviewed Conference Papers

Joint audio-video object localization using a recursive multi-state multi-sensor estimator
Citation key strobel2000a
Author Strobel, N. and Spors, Sascha and Rabenstein, R.
Title of Book International Conference on Acoustics, Speech and Signal Processing (ICASSP 2000)
Pages 3781–3784
Year 2000
ISBN 0-7803-6293-4
Abstract Object localization based on audio and video information is important for the analysis of dynamic scenes, such as video conferences or traffic situations. In this paper, we view the the dynamic audio-video object localization problem as a joint recursive estimation problem. It is solved using a decentralized Kalman filter fusing both audio and video position estimates. To better take into account different object maneuvers, multiple state-space equations are also incorporated. The result is a recursive multi-state multi-sensor estimator. Experiments show that it yields significantly improved joint position estimates compared to results achieved by using either an audio or a video system only.
