Toward an infrastructure for data-driven multimodal communication research

Steen FF, Hougaard A, Joo J, Olza I, Cánovas CP, Pleshakova A, Ray S, Uhrig P, Valenzuela J, Woźny J, Turner M (2018)

Publication Language: English

Publication Type: Journal article

Publication year: 2018


Book Volume: 4

Article Number: 20170041

Journal Issue: 1

DOI: 10.1515/lingvan-2017-0041


Research into the multimodal dimensions of human communication faces a set of distinctive methodological challenges. Collecting the datasets is resource-intensive, analysis often lacks peer validation, and the absence of shared datasets makes it difficult to develop standards. External validity is hampered by small datasets, yet large datasets are intractable. Red Hen Lab spearheads an international infrastructure for data-driven multimodal communication research, facilitating an integrated cross-disciplinary workflow. Linguists, communication scholars, statisticians, and computer scientists work together to develop research questions, annotate training sets, and develop pattern discovery and Machine learning tools that handle vast collections of multimodal data, beyond the dreams of previous researchers. This infrastructure makes it possible for researchers at multiple sites to work in real-time in transdisciplinary teams. We review the vision, progress, and prospects of this research consortium.

Authors with CRIS profile

Involved external institutions

How to cite


Steen, F.F., Hougaard, A., Joo, J., Olza, I., Cánovas, C.P., Pleshakova, A.,... Turner, M. (2018). Toward an infrastructure for data-driven multimodal communication research. Linguistics Vanguard, 4(1).


Steen, Francis F., et al. "Toward an infrastructure for data-driven multimodal communication research." Linguistics Vanguard 4.1 (2018).

BibTeX: Download