Same Semantics of the Signal - What Do We Cluster with what Representation

Barnhill A, Traub O, Nöth E, Maier A, Bergler C (2025)

Publication Language: English

Publication Type: Conference contribution, Original article

Publication year: 2025

Conference Proceedings Title: Proceedings of the International Conference on Acoustics, Speech, and Signal Processing

Event location: Hyderabad, India

Abstract

Semantic clustering of bioacoustic signals is crucial for a deeper understanding of intra-class differences. This is particularly important for understanding killer whale signals, as their vocalizations are learned behaviors and determination of matrilineal-specific dialects is reliant upon subtle differences within instances which may be characterized into a single larger category. Aspects of data collection may have an effect on how these calls are grouped, and it is therefore necessary to understand what the focus of the feature generation algorithm is. This study addresses the impact of factors such as recording conditions and environment, together with its respective relative noise levels, by first analyzing two different deep learningand data-driven feature representations, either derived by an undercomplete autoencoder or a supervised call type classifier. These are then compared with representations generated by two state-of-the-art transformer-based tools, namely HuBERT and Wav2Vec2.

Authors with CRIS profile

Alexander Barnhill Lehrstuhl für Informatik 5 (Mustererkennung) Elmar Nöth Professur für Informatik (Mustererkennung) Andreas Maier Lehrstuhl für Informatik 5 (Mustererkennung)

Involved external institutions

Ostbayerische Technische Hochschule (OTH) Amberg-Weiden

Germany (DE)

How to cite

APA:

Barnhill, A., Traub, O., Nöth, E., Maier, A., & Bergler, C. (2025). Same Semantics of the Signal - What Do We Cluster with what Representation. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing. Hyderabad, India, IN.

MLA:

Barnhill, Alexander, et al. "Same Semantics of the Signal - What Do We Cluster with what Representation." Proceedings of the 2025 International Conference on Acoustics, Speech, and Signal Processing, Hyderabad, India 2025.

BibTeX: Download