Deep Learning and Domain Transfer for Orca Vocalization Detection - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2020

Deep Learning and Domain Transfer for Orca Vocalization Detection

Résumé

In this paper, we study the difficulties of domain transfer when training deep learning models, on a specific task that is orca vocalization detection. Deep learning appears to be an answer to many sound recognition tasks in human speech analysis as well as in bioacoustics. This method allows to learn from large amounts of data, and find the best scoring way to discriminate between classes (e.g. orca vocalization and other sounds). However, to learn the perfect data representation and discrimination boundaries, all possible data configurations need to be processed. This causes problems when those configurations are ever changing (e.g. in our experiment, a change in the recording system happened to considerably disturb our previously well performing model). We thus explore approaches to compensate on the difficulties faced with domain transfer, with two convolutionnal neural networks (CNN) architectures, one that works in the time-frequency domain, and one that works directly on the time domain.
Fichier principal
Vignette du fichier
IJCNN_ORCALAB.pdf (1.85 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-02865300 , version 1 (11-06-2020)

Identifiants

  • HAL Id : hal-02865300 , version 1

Citer

Paul Best, Maxence Ferrari, Marion Poupard, Sébastien Paris, Ricard Marxer, et al.. Deep Learning and Domain Transfer for Orca Vocalization Detection. International joint conference on neural networks, Jul 2020, glasgow, United Kingdom. ⟨hal-02865300⟩
241 Consultations
356 Téléchargements

Partager

Gmail Facebook X LinkedIn More