Densely Connected CNNs for Bird Audio Detection - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2017

Densely Connected CNNs for Bird Audio Detection

Résumé

Detecting bird sounds in audio recordings automatically, if accurate enough, is expected to be of great help to the research community working in bio- and ecoacoustics, interested in monitoring biodiversity based on audio field recordings. To estimate how accurate the state-of-the-art machine learning approaches are, the Bird Audio Detection challenge involving large audio datasets was recently organized. In this paper, experiments using several types of convolutional neural networks (i.e. standard CNNs, residual nets and densely connected nets) are reported in the framework of this challenge. DenseNets were the preferred solution since they were the best performing and most compact models, leading to a 88.22% area under the receiver operator curve score on the test set of the challenge. Performance gains were obtained thank to data augmentation through time and frequency shifting, model parameter averaging during training and ensemble methods using the geometric mean. On the contrary, the attempts to enlarge the training dataset with samples of the test set with automatic predictions used as pseudo-groundtruth labels consistently degraded performance.
Fichier principal
Vignette du fichier
pellegrini_19111.pdf (283.93 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01913975 , version 1 (06-11-2018)

Identifiants

  • HAL Id : hal-01913975 , version 1
  • OATAO : 19111

Citer

Thomas Pellegrini. Densely Connected CNNs for Bird Audio Detection. 25th European Signal and Image Processing Conference (EUSIPCO 2017), Aug 2017, Kos island, Greece. pp. 1734-1738. ⟨hal-01913975⟩
43 Consultations
108 Téléchargements

Partager

Gmail Facebook X LinkedIn More