Haubner T, Brendel A, Kellermann W (2023)
Publication Language: English
Publication Type: Journal article, Original article
Publication year: 2023
Book Volume: 32
Pages Range: 1-12
The attenuation of acoustic loudspeaker echoes remains to be one of the open challenges to achieve pleasant full-duplex hands free speech communication. In many modern signal enhancement interfaces, this problem is addressed by a linear acoustic echo canceler which subtracts a loudspeaker echo estimate from the recorded microphone signal. To obtain precise echo estimates, the parameters of the echo canceler, i.e., the filter coefficients, need to be estimated quickly and precisely from the observed loudspeaker and microphone signals. For this a sophisticated adaptation control is required to deal with high-power double-talk and rapidly track time-varying acoustic environments which are often faced with portable devices. In this paper, we address this problem by end-to-end deep learning. In particular, we suggest to infer the step-size for a least mean squares frequency-domain adaptive filter update by a Deep Neural Network (DNN). Two different step-size inference approaches are investigated. On the one hand broadband approaches, which use a single DNN to jointly infer step-sizes for all frequency bands, and on the other hand narrowband methods, which exploit individual DNNs per frequency band. The discussion of benefits and disadvantages of both approaches leads to a novel hybrid approach which shows improved echo cancellation while requiring only small DNN architectures. Furthermore, we investigate the effect of different loss functions, signal feature vectors, and DNN output layer architectures on the echo cancellation performance from which we obtain valuable insights into the general design and functionality of DNN-based adaptation control algorithms.
Haubner, T., Brendel, A., & Kellermann, W. (2023). End-to-End Deep Learning-Based Adaptation Control for Linear Acoustic Echo Cancellation. IEEE/ACM Transactions on Audio, Speech and Language Processing, 32, 1-12. https://dx.doi.org/10.1109/TASLP.2023.3325923
Haubner, Thomas, Andreas Brendel, and Walter Kellermann. "End-to-End Deep Learning-Based Adaptation Control for Linear Acoustic Echo Cancellation." IEEE/ACM Transactions on Audio, Speech and Language Processing 32 (2023): 1-12.