An audio-visual database for evaluating person tracking algorithms

Krindis M, Stamou G, Teutsch H, Spors S, Nikolaidis N, Rabenstein R, Pitas I (2005)


Publication Language: English

Publication Status: Published

Publication Type: Conference contribution, Conference Contribution

Publication year: 2005

Publisher: IEEE

Article Number: 1415385

Event location: Philadelphia, PA US

ISBN: 9780780388741

DOI: 10.1109/ICASSP.2005.1415385

Abstract

This paper presents an audio-visual database that can be used as a reference database for testing and evaluation of video, audio or joint audio-visual person tracking algorithms, as well as speaker localization methods. Additional possible uses include the testing of face detection and pose estimation algorithms. A number of different scenes are included in the database, ranging from simple to complex scenes that can challenge existing algorithms. They include different subjects, with appearances that can cause problems to video tracking algorithms, (e.g. facial features such as beards, glasses, etc.), optimal and artificially created sub-optimal lighting conditions, subject movement based on simple as well as random motion trajectories, different distances from the camera/microphones and occlusion. The database incorporates ground truth data (3-D position in time) originating from a commercially available 4-camera infrared (IR) tracking system. Examples of how the database can be used to evaluate video and audio tracking algorithms are also provided. © 2005 IEEE.

Authors with CRIS profile

How to cite

APA:

Krindis, M., Stamou, G., Teutsch, H., Spors, S., Nikolaidis, N., Rabenstein, R., & Pitas, I. (2005). An audio-visual database for evaluating person tracking algorithms. In Proceedings of the 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05. Philadelphia, PA, US: IEEE.

MLA:

Krindis, M., et al. "An audio-visual database for evaluating person tracking algorithms." Proceedings of the 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05, Philadelphia, PA IEEE, 2005.

BibTeX: Download