Blind bandwidth extension of speech based on LPCNet

Schmidt K, Edler B (2021)


Publication Type: Conference contribution

Publication year: 2021

Publisher: European Signal Processing Conference, EUSIPCO

Book Volume: 2021-January

Pages Range: 426-430

Conference Proceedings Title: European Signal Processing Conference

Event location: Amsterdam NL

ISBN: 9789082797053

DOI: 10.23919/Eusipco47968.2020.9287465

Abstract

A blind bandwidth extension is presented which improves the perceived quality of 4 kHz speech by artificially extending the speech's frequency range to 8 kHz. Based on the source-filter model of the human speech production, the speech signal is decomposed into spectral envelope and excitation signal and each of them is extrapolated separately. With this decomposition, good perceptual quality can be achieved while keeping the computational complexity low. The focus of this work is in the generation of an excitation signal with and autoregressive model that calculates a distribution for each audio sample conditioned on previous samples. This is achieved with a deep neural network following the architecture of LPCNet [1]. A listening test shows that it significantly improves the perceived quality of bandlimited speech. The system has an algorithmic delay of 30 ms and can be applied in state-of-the-art speech and audio codecs.

Authors with CRIS profile

How to cite

APA:

Schmidt, K., & Edler, B. (2021). Blind bandwidth extension of speech based on LPCNet. In European Signal Processing Conference (pp. 426-430). Amsterdam, NL: European Signal Processing Conference, EUSIPCO.

MLA:

Schmidt, Konstantin, and Bernd Edler. "Blind bandwidth extension of speech based on LPCNet." Proceedings of the 28th European Signal Processing Conference, EUSIPCO 2020, Amsterdam European Signal Processing Conference, EUSIPCO, 2021. 426-430.

BibTeX: Download