Schröter H, Nöth E, Maier A, Cheng R, Barth V, Bergler C (2019)
Publication Language: English
Publication Type: Conference contribution, Original article
Publication year: 2019
Publisher: IEEE
Pages Range: 8231-8235
Conference Proceedings Title: ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
ISBN: 978-1-4799-8131-1
URI: https://ieeexplore.ieee.org/abstract/document/8683785
DOI: 10.1109/ICASSP.2019.8683785
Open Access Link: https://ieeexplore.ieee.org/abstract/document/8683785
Audiovisual media are increasingly used to study the communication and behavior of animal groups, e.g. by placing microphones in the animals habitat resulting in huge datasets with only a small amount of animal interactions. The Orcalab has recorded orca whales since 1973 using stationary underwater hydrophones and made it publicly available on the Orchive. There exist over 15 000 manually extracted orca/noise annotations and about 20 000 h unseen audio data. To analyze the behavior and communication of killer whales we need to interpret the different call types. In this work, we present a two-stage classification approach using the labeled call/noise files and a few labeled call-type files. Results indicate a reliable accuracy of 95.0 % for call segmentation and 87 % for classification of 12 call classes. We further visualize the learned orca call representations in the convolutional neural network (CNN) activations to explain the potential of CNN based recognition for bioaccousitc signals.
APA:
Schröter, H., Nöth, E., Maier, A., Cheng, R., Barth, V., & Bergler, C. (2019). Segmentation, Classification, and Visualization of Orca Calls Using Deep Learning. In ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 8231-8235). Brighton, GB: IEEE.
MLA:
Schröter, Hendrik, et al. "Segmentation, Classification, and Visualization of Orca Calls Using Deep Learning." Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Brighton IEEE, 2019. 8231-8235.
BibTeX: Download