MPEG Standards for Compressed Representation of Immersive Audio

Quackenbush SR, Herre J (2021)


Publication Language: English

Publication Type: Journal article

Publication year: 2021

Journal

Book Volume: 109

Pages Range: 1578-1589

Journal Issue: 9

DOI: 10.1109/JPROC.2021.3075390

Abstract

The term "immersive audio" is frequently used to describe an audio experience that provides the listener the sensation of being fully immersed or "present" in a sound scene. This can be achieved via different presentation modes, such as surround sound (several loudspeakers horizontally arranged around the listener), 3D audio (with loudspeakers at, above, and below listener ear level), and binaural audio to headphones. This article provides an overview of two recent standards that support the bitrate-efficient carriage of high-quality immersive sound. The first is MPEG-H 3D audio, which is a versatile standard that supports multiple immersive sound signal formats (channels, objects, and higher order ambisonics) and is now being adopted in broadcast and streaming applications. The second is MPEG-I immersive audio, an extension of 3D audio, currently under development, which is targeted for virtual and augmented reality applications. This will support rendering of fully user-interactive immersive sound for three degrees of user movement [three degrees of freedom (3DoF)], i.e., yaw, pitch, and roll head movement, and for six degrees of user movement [six degrees of freedom (6DoF)], i.e., 3DoF plus translational x, y, and z user position movements.

Authors with CRIS profile

Involved external institutions

How to cite

APA:

Quackenbush, S.R., & Herre, J. (2021). MPEG Standards for Compressed Representation of Immersive Audio. Proceedings of the IEEE, 109(9), 1578-1589. https://dx.doi.org/10.1109/JPROC.2021.3075390

MLA:

Quackenbush, Schuyler R., and Jürgen Herre. "MPEG Standards for Compressed Representation of Immersive Audio." Proceedings of the IEEE 109.9 (2021): 1578-1589.

BibTeX: Download