Deleforge A, Horaud R (2012)
Publication Language: English
Publication Type: Conference contribution
Publication year: 2012
Conference Proceedings Title: Proceedings of the seventh annual ACM/IEEE international conference on Human-Robot Interaction
ISBN: 978-1-4503-1063-5
Human-robot communication is often faced with the difficult problem of interpreting ambiguous auditory data. For example, the acoustic signals perceived by a humanoid with its on-board microphones contain a mix of sounds such as speech, music, electronic devices, all in the presence of attenuation and reverberations. In this paper we propose a novel method, based on a generative probabilistic model and on active binaural hearing, allowing a robot to robustly perform sound-source separation and localization. We show how interaural spectral cues can be used within a constrained mixture model specifically designed to capture the richness of the data gathered with two microphones mounted onto a human-like artificial head. We describe in detail a novel EM algorithm, we analyse its initialization, speed of convergence and complexity, and we assess its performance with both simulated and real data.
APA:
Deleforge, A., & Horaud, R. (2012). The cocktail party robot: Sound source separation and localisation with an active binaural head. In Proceedings of the seventh annual ACM/IEEE international conference on Human-Robot Interaction. Boston, US.
MLA:
Deleforge, Antoine, and Radu Horaud. "The cocktail party robot: Sound source separation and localisation with an active binaural head." Proceedings of the 7th ACM/IEEE International Conference on Human-Robot Interaction (HRI), Boston 2012.
BibTeX: Download