Are you looking at me, are you talking with me: Multimodal classification of the focus of attention

Batliner A, Nöth E, Hacker C (2006)


Publication Status: Published

Publication Type: Conference contribution, Conference Contribution

Publication year: 2006

Journal

Publisher: Springer-verlag

City/Town: Berlin, Heidelberg

Book Volume: 4188

Pages Range: 581-588

Conference Proceedings Title: Text, Speech and Dialogue. 9th International Conference, TSD 2006, Brno, Czech Republic, September 2006, Proceedings

Event location: Brno CZ

URI: http://www5.informatik.uni-erlangen.de/Forschung/Publikationen/2006/Hacker06-AYL.pdf

Abstract

Automatic dialogue systems get easily confused if speech is recognized which is not directed to the system. Besides noise or other people's conversation, even the user's utterance can cause difficulties when he is talking to someone else or to himself ("Off-Talk"). In this paper the automatic classification of the user's focus of attention is investigated. In the German SmartWeb project, a mobile device is used to get access to the semantic web. In this scenario, two modalities are provided speech and video signal. This makes it possible to classify whether a spoken request is addressed to the system or not: with the camera of the mobile device, the user's gaze direction is detected; in the speech signal, prosodic features are analyzed. Encouraging recognition rates of up to 93% are achieved in the speech-only condition. Further improvement is expected from the fusion of the two information sources.

Authors with CRIS profile

How to cite

APA:

Batliner, A., Nöth, E., & Hacker, C. (2006). Are you looking at me, are you talking with me: Multimodal classification of the focus of attention. In Sojka P., Kopecek I., Pala K. (Eds.), Text, Speech and Dialogue. 9th International Conference, TSD 2006, Brno, Czech Republic, September 2006, Proceedings (pp. 581-588). Brno, CZ: Berlin, Heidelberg: Springer-verlag.

MLA:

Batliner, Anton, Elmar Nöth, and Christian Hacker. "Are you looking at me, are you talking with me: Multimodal classification of the focus of attention." Proceedings of the 9th International Conference, TSD 2006, Brno Ed. Sojka P., Kopecek I., Pala K., Berlin, Heidelberg: Springer-verlag, 2006. 581-588.

BibTeX: Download