Unsupervised Domain Adaptation via Principal Subspace Projection for Acoustic Scene Classification

Mezza AI, Habets E, Müller M, Sarti A (2022)


Publication Type: Journal article

Publication year: 2022

Journal

DOI: 10.1007/s11265-021-01720-9

Abstract

Existing acoustic scene classification (ASC) systems often fail to generalize across different recording devices. In this work, we present an unsupervised domain adaptation method for ASC based on data standardization and feature projection. First, log-amplitude spectro-temporal features are standardized in a band-wise fashion over samples and time. Then, both source- and target-domain samples are projected onto the span of the principal eigenvectors of the covariance matrix of source-domain training data. The proposed method, being devised as a preprocessing procedure, is independent of the choice of the classification algorithm and can be readily applied to any ASC model at a minimal cost. Using the TUT Urban Acoustic Scenes 2018 Mobile Development dataset, we show that the proposed method can provide an absolute increment of over 10% compared to state-of-the-art unsupervised adaptation methods. Furthermore, the proposed method consistently outperforms a recent ASC model that ranked first in Task 1-A of the 2021 DCASE Challenge when evaluated on various unseen devices from the TAU Urban Acoustic Scenes 2020 Mobile Development dataset. In addition, our method appears robust even when provided with a small amount of target-domain data, proving effective using as few as 90 seconds of test audio recordings. Finally, we show that the proposed adaptation method can also be employed as a feature extraction stage for shallower neural networks, thus significantly reducing model complexity.

Authors with CRIS profile

Involved external institutions

How to cite

APA:

Mezza, A.I., Habets, E., Müller, M., & Sarti, A. (2022). Unsupervised Domain Adaptation via Principal Subspace Projection for Acoustic Scene Classification. Journal of Signal Processing Systems. https://doi.org/10.1007/s11265-021-01720-9

MLA:

Mezza, Alessandro Ilic, et al. "Unsupervised Domain Adaptation via Principal Subspace Projection for Acoustic Scene Classification." Journal of Signal Processing Systems (2022).

BibTeX: Download