WOZ acoustic data collection for interactive TV

Journal article
(Original article)


Publication Details

Author(s): Brutti A, Cristoforetti L, Kellermann W, Marquardt L, Omologo M
Journal: Language Resources and Evaluation
Publisher: Springer Verlag (Germany)
Publication year: 2010
Volume: 44
Journal issue: 3
Pages range: 205-219
ISSN: 1574-020X


Abstract


This paper describes a multichannel acoustic data collection recorded under the European DICIT project, during Wizard of Oz (WOZ) experiments carried out at FAU and FBK-irst laboratories. The application of interest in DICIT is a distant-talking interface for control of interactive TV working in a typical living room, with many interfering devices. The objective of the experiments was to collect a database supporting efficient development and tuning of acoustic processing algorithms for signal enhancement. In DICIT, techniques for sound source localization, multichannel acoustic echo cancellation, blind source separation, speech activity detection, speaker identification and verification as well as beamforming are combined to achieve a maximum possible reduction of the user speech impairments typical of distant-talking interfaces. The collected database permitted to simulate at preliminary stage a realistic scenario and to tailor the involved algorithms to the observed user behaviors. In order to match the project requirements, the WOZ experiments were recorded in three languages: English, German and Italian. Besides the user inputs, the database also contains non-speech related acoustic events, room impulse response measurements and video data, the latter used to compute three-dimensional positions of each subject. Sessions were manually transcribed and segmented at word level, introducing also specific labels for acoustic events. © 2010 Springer Science+Business Media B.V.



FAU Authors / FAU Editors

Kellermann, Walter Prof. Dr.-Ing.
Professur für Nachrichtentechnik


External institutions
Fondazione Bruno Kessler (FBK) (früher: ITC-irst)


How to cite

APA:
Brutti, A., Cristoforetti, L., Kellermann, W., Marquardt, L., & Omologo, M. (2010). WOZ acoustic data collection for interactive TV. Language Resources and Evaluation, 44(3), 205-219. https://dx.doi.org/10.1007/s10579-010-9116-x

MLA:
Brutti, Alessio, et al. "WOZ acoustic data collection for interactive TV." Language Resources and Evaluation 44.3 (2010): 205-219.

BibTeX: 

Last updated on 2018-08-08 at 15:08