Deep Learning and Domain Transfer for Orca Vocalization Detection

In this paper, we study the difficulties of domain transfer when training deep learning models, on a specific task that is orca vocalization detection. Deep learning appears to be an answer to many sound recognition tasks in human speech analysis as well as in bioacoustics. This method allows to learn from large amounts of data, and find the best scoring way to discriminate between classes (e.g. orca vocalization and other sounds). However, to learn the perfect data representation and discrimination boundaries, all possible data configurations need to be processed. This causes problems when those configurations are ever changing (e.g. in our experiment, a change in the recording system happened to considerably disturb our previously well performing model). We thus explore approaches to compensate on the difficulties faced with domain transfer, with two convolutionnal neural networks (CNN) architectures, one that works in the time-frequency domain, and one that works directly on the time domain.

Mots clés

End-to-end sound recognition Orca Vocalizations Deep Convolutionnal Neural Networks Spectral sound recognition

Domaines

Machine Learning [stat.ML] Traitement du signal et de l'image [eess.SP]

Fichier principal

IJCNN_ORCALAB.pdf (1.85 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Paul Best : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02865300

Soumis le : jeudi 11 juin 2020-16:09:21

Dernière modification le : vendredi 22 mars 2024-18:24:04

Dates et versions

hal-02865300 , version 1 (11-06-2020)

Identifiants

HAL Id : hal-02865300 , version 1

Citer

Paul Best, Maxence Ferrari, Marion Poupard, Sébastien Paris, Ricard Marxer, et al.. Deep Learning and Domain Transfer for Orca Vocalization Detection. International joint conference on neural networks, Jul 2020, glasgow, United Kingdom. ⟨hal-02865300⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-TLN CNRS UNIV-AMU LIS-LAB ANR U-PICARDIE LAMFA INCIAM

241 Consultations

356 Téléchargements