Skip to Main content Skip to Navigation
Conference papers

Deep Learning and Domain Transfer for Orca Vocalization Detection

Abstract : In this paper, we study the difficulties of domain transfer when training deep learning models, on a specific task that is orca vocalization detection. Deep learning appears to be an answer to many sound recognition tasks in human speech analysis as well as in bioacoustics. This method allows to learn from large amounts of data, and find the best scoring way to discriminate between classes (e.g. orca vocalization and other sounds). However, to learn the perfect data representation and discrimination boundaries, all possible data configurations need to be processed. This causes problems when those configurations are ever changing (e.g. in our experiment, a change in the recording system happened to considerably disturb our previously well performing model). We thus explore approaches to compensate on the difficulties faced with domain transfer, with two convolutionnal neural networks (CNN) architectures, one that works in the time-frequency domain, and one that works directly on the time domain.
Complete list of metadatas
Contributor : Paul Best <>
Submitted on : Thursday, June 11, 2020 - 4:09:21 PM
Last modification on : Saturday, June 13, 2020 - 3:37:35 AM


Files produced by the author(s)


  • HAL Id : hal-02865300, version 1



Paul Best, Maxence Ferrari, Marion Poupard, Sébastien Paris, Ricard Marxer, et al.. Deep Learning and Domain Transfer for Orca Vocalization Detection. International joint conference on neural networks, Jul 2020, glasgow, United Kingdom. ⟨hal-02865300⟩



Record views


Files downloads