Segmentation, Classification, and Visualization of Orca Calls Using Deep Learning

Schröter H, Nöth E, Maier A, Cheng R, Barth V, Bergler C (2019)


Publication Language: English

Publication Type: Conference contribution, Original article

Publication year: 2019

Publisher: IEEE

Pages Range: 8231-8235

Conference Proceedings Title: ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Event location: Brighton GB

ISBN: 978-1-4799-8131-1

URI: https://ieeexplore.ieee.org/abstract/document/8683785

DOI: 10.1109/ICASSP.2019.8683785

Open Access Link: https://ieeexplore.ieee.org/abstract/document/8683785

Abstract

Audiovisual media are increasingly used to study the communication and behavior of animal groups, e.g. by placing microphones in the animals habitat resulting in huge datasets with only a small amount of animal interactions. The Orcalab has recorded orca whales since 1973 using stationary underwater hydrophones and made it publicly available on the Orchive. There exist over 15 000 manually extracted orca/noise annotations and about 20 000 h unseen audio data. To analyze the behavior and communication of killer whales we need to interpret the different call types. In this work, we present a two-stage classification approach using the labeled call/noise files and a few labeled call-type files. Results indicate a reliable accuracy of 95.0 % for call segmentation and 87 % for classification of 12 call classes. We further visualize the learned orca call representations in the convolutional neural network (CNN) activations to explain the potential of CNN based recognition for bioaccousitc signals.

Authors with CRIS profile

Related research project(s)

How to cite

APA:

Schröter, H., Nöth, E., Maier, A., Cheng, R., Barth, V., & Bergler, C. (2019). Segmentation, Classification, and Visualization of Orca Calls Using Deep Learning. In ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 8231-8235). Brighton, GB: IEEE.

MLA:

Schröter, Hendrik, et al. "Segmentation, Classification, and Visualization of Orca Calls Using Deep Learning." Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Brighton IEEE, 2019. 8231-8235.

BibTeX: Download