ORCA-PARTY: An Automatic Killer Whale Sound Type Separation Toolkit Using Deep Learning

Bergler C, Schmitt M, Maier A, Cheng RX, Barth V, Nöth E (2022)


Publication Language: English

Publication Type: Conference contribution, Original article

Publication year: 2022

Publisher: IEEE

Conference Proceedings Title: ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Event location: Singapore SG

DOI: 10.1109/icassp43922.2022.9746623

Abstract

Data-driven and machine-based analysis of massive bioacoustic data collections, in particular acoustic regions containing a substantial number of vocalizations events, is essential and extremely valuable to identify recurring vocal paradigms. However, these acoustic sections are usually characterized by a strong incidence of overlapping vocalization events, a major problem severely affecting subsequent human-/machine- based analysis and interpretation. Robust machine-driven signal separation of species-specific call types is extremely challenging due to missing ground truth data, speaker/source- relevant information, limited knowledge about inter- and intra-call type variations, next to diverse recording conditions. The current study is the first introducing a fully- automated deep signal separation approach for overlapping orca vocalizations, addressing all of the previously mentioned challenges, together with one of the largest bioacoustic data archives recorded on killer whales (Orcinus Orca). Incorporating ORCA-PARTY as additional data enhancement step for downstream call type classification demonstrated to be extremely valuable. Besides the proof of cross-domain applicability and consistently promising results on non-overlapping signals, significant improvements were achieved when processing acoustic orca segments comprising a multitude of vocal activities. Apart from auspicious visual inspections, a final numerical evaluation on an unseen dataset proved that about 30 % more known sound patterns could be identified.

Authors with CRIS profile

Involved external institutions

How to cite

APA:

Bergler, C., Schmitt, M., Maier, A., Cheng, R.X., Barth, V., & Nöth, E. (2022). ORCA-PARTY: An Automatic Killer Whale Sound Type Separation Toolkit Using Deep Learning. In ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Singapore, SG: IEEE.

MLA:

Bergler, Christian, et al. "ORCA-PARTY: An Automatic Killer Whale Sound Type Separation Toolkit Using Deep Learning." Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Singapore IEEE, 2022.

BibTeX: Download