Preferred levels for background ducking to produce esthetically pleasing audio for TV with clear speech

Torcoli M, Freke-Morin A, Paulus J, Simon C, Shirley B (2019)


Publication Type: Journal article

Publication year: 2019

Journal

Book Volume: 67

Pages Range: 1003-1011

Journal Issue: 12

DOI: 10.17743/jaes.2019.0052

Abstract

In audio production, background ducking facilitates speech intelligibility while allowing the background to fulfill its purpose, e.g., to create ambience, set the mood, or convey semantic cues. Technical details for recommended ducking practices are not currently documented in the literature. Hence, we first analyzed common practices found in TV documentaries. Second, a listening test investigated the preferences of 22 normal-hearing participants on the Loudness Difference (LD) between commentary and background during ducking. Highly personal preferences were observed, highlighting the importance of object-based personalization. Statistically significant difference was found between non-expert and expert listeners. On average, non-experts preferred LDs that were 4 LU higher than the ones preferred by experts. A statistically significant difference was also found between Commentary over Music (CoM) and Commentary over Ambience (CoA). Based on the test results, we recommend at least 10 LU difference for CoM and at least 15 LU for CoA. Moreover, a computational method based on the Binaural Distortion-Weighted Glimpse Proportion (BiDWGP) was found to match the median preferred LD for each item with good accuracy (mean absolute error = 1.97 LU ± 2.50).

Additional Organisation(s)

Involved external institutions

How to cite

APA:

Torcoli, M., Freke-Morin, A., Paulus, J., Simon, C., & Shirley, B. (2019). Preferred levels for background ducking to produce esthetically pleasing audio for TV with clear speech. Journal of the Audio Engineering Society, 67(12), 1003-1011. https://doi.org/10.17743/jaes.2019.0052

MLA:

Torcoli, Matted, et al. "Preferred levels for background ducking to produce esthetically pleasing audio for TV with clear speech." Journal of the Audio Engineering Society 67.12 (2019): 1003-1011.

BibTeX: Download