Schmidt K, Edler B (2021)
Publication Type: Conference contribution
Publication year: 2021
Publisher: European Signal Processing Conference, EUSIPCO
Book Volume: 2021-January
Pages Range: 426-430
Conference Proceedings Title: European Signal Processing Conference
ISBN: 9789082797053
DOI: 10.23919/Eusipco47968.2020.9287465
A blind bandwidth extension is presented which improves the perceived quality of 4 kHz speech by artificially extending the speech's frequency range to 8 kHz. Based on the source-filter model of the human speech production, the speech signal is decomposed into spectral envelope and excitation signal and each of them is extrapolated separately. With this decomposition, good perceptual quality can be achieved while keeping the computational complexity low. The focus of this work is in the generation of an excitation signal with and autoregressive model that calculates a distribution for each audio sample conditioned on previous samples. This is achieved with a deep neural network following the architecture of LPCNet [1]. A listening test shows that it significantly improves the perceived quality of bandlimited speech. The system has an algorithmic delay of 30 ms and can be applied in state-of-the-art speech and audio codecs.
APA:
Schmidt, K., & Edler, B. (2021). Blind bandwidth extension of speech based on LPCNet. In European Signal Processing Conference (pp. 426-430). Amsterdam, NL: European Signal Processing Conference, EUSIPCO.
MLA:
Schmidt, Konstantin, and Bernd Edler. "Blind bandwidth extension of speech based on LPCNet." Proceedings of the 28th European Signal Processing Conference, EUSIPCO 2020, Amsterdam European Signal Processing Conference, EUSIPCO, 2021. 426-430.
BibTeX: Download