Improved Psychoacoustic Model for Efficient Perceptual Audio Codecs

Disch S, van de Par S, Niedermeier A, Burdiel E, Ceberio Berasategui A, Edler B (2018)


Publication Type: Conference contribution

Publication year: 2018

Conference Proceedings Title: 145th Audio Engineering Society Convention

Event location: New York (NY)

Journal Issue: Paper No. 10029

URI: http://www.aes.org/e-lib/browse.cfm?elib=19755

Abstract

Since early perceptual audio coders such as mp3, the underlying psychoacoustic model that controls the encoding process has not undergone many dramatic changes. Meanwhile, modern audio coders have been equipped with semi-parametric or parametric coding tools such as audio bandwidth extension. Thereby, the initial psychoacoustic model used in a perceptual coder, just considering added quantization noise, became partly unsuitable. We propose the use of an improved psychoacoustic excitation model based on an existing model proposed by Dau et al. in 1997. This modulation-based model is essentially independent from the input waveform by calculating an internal auditory representation. Using the example of MPEG-H 3D Audio and its semi-parametric Intelligent Gap Filling (IGF) tool, we demonstrate that we can successfully control the IGF parameter selection process to achieve overall improved perceptual quality.

Authors with CRIS profile

Involved external institutions

How to cite

APA:

Disch, S., van de Par, S., Niedermeier, A., Burdiel, E., Ceberio Berasategui, A., & Edler, B. (2018). Improved Psychoacoustic Model for Efficient Perceptual Audio Codecs. In 145th Audio Engineering Society Convention. New York (NY).

MLA:

Disch, Sascha, et al. "Improved Psychoacoustic Model for Efficient Perceptual Audio Codecs." Proceedings of the 145th Audio Engineering Society Convention, New York (NY) 2018.

BibTeX: Download