Combined LCMV-TRINICON beamforming for separating multiple speech sources in noisy and reverberant environments

Markovich-Golan S, Gannot S, Kellermann W (2017)

Publication Language: English

Publication Type: Journal article

Publication year: 2017

Journal

IEEE/ACM Transactions on Audio, Speech and Language Processing Institute of Electrical and Electronics Engineers (IEEE)

Book Volume: 25

Pages Range: 320-332

Journal Issue: 2

DOI: 10.1109/TASLP.2016.2633806

Abstract

he problem of source separation using an array of microphones in reverberant and noisy conditions is addressed. We consider applying the well-known linearly constrained minimum variance (LCMV) beamformer (BF) for extracting individual speakers. Constraints are defined using relative transfer functions (RTFs) for the sources, which are ratios of acoustic transfer functions (ATFs) between any microphone and a reference microphone. The latter are usually estimated by methods that rely on single-talk time segments where only a single source is active and on reliable knowledge of the source activity. Two novel algorithms for estimation of RTFs using the “Triple N” ICA for convolutive mixtures (TRINICON) framework are proposed, not resorting to the usually unavailable source activity pattern. The first algorithm estimates the RTFs of the sources by applying multiple two-channel geometrically constrained (GC) TRINICON units, where approximate direction of arrival information for the sources is utilized for ensuring convergence to the desired solution. The GC-TRINICON is applied to all microphone pairs using a common reference microphone. In the second algorithm, we propose to estimate RTFs iteratively using GC-TRINICON, where instead of using a fixed reference microphone as before, we suggest to use the output signals of LCMV-BFs from the previous iteration as spatially processed references with improved signal-to-interference-and-noise ratio. For both algorithms, a simple detection of noise-only time segments is required for estimating the covariance matrix of noise and interference. We conduct an experimental study in which the performance of the proposed methods is confirmed and compared to corresponding supervised methods.

Authors with CRIS profile

Walter Kellermann Professur für Signalverarbeitung

How to cite

APA:

Markovich-Golan, S., Gannot, S., & Kellermann, W. (2017). Combined LCMV-TRINICON beamforming for separating multiple speech sources in noisy and reverberant environments. IEEE/ACM Transactions on Audio, Speech and Language Processing, 25(2), 320-332. https://dx.doi.org/10.1109/TASLP.2016.2633806

MLA:

Markovich-Golan, Shmulik, S. Gannot, and Walter Kellermann. "Combined LCMV-TRINICON beamforming for separating multiple speech sources in noisy and reverberant environments." IEEE/ACM Transactions on Audio, Speech and Language Processing 25.2 (2017): 320-332.

BibTeX: Download