Detection and Classification of Acoustic Scenes and Events

Abstract : For intelligent systems to make best use of the audio modality, it is important that they can recognise not just speech and music, which have been researched as specific tasks, but also general sounds in everyday environments. To stimulate research in this field we conducted a public research challenge: the IEEE Audio and Acoustic Signal Processing Technical Committee challenge on Detection and Classification of Acoustic Scenes and Events (DCASE). In this paper we report on the state of the art in automatically classifying audio scenes, and automatically detecting and classifying audio events. We survey prior work as well as the state of the art represented by the submissions to the challenge from various research groups. We also provide detail on the organisation of the challenge, so that our experience as challenge hosts may be useful to those organising challenges in similar domains. We created new audio datasets and baseline systems for the challenge: these, as well as some submitted systems, are publicly available under open licences, to serve as benchmark for further research in general-purpose machine listening.
Type de document :
Pré-publication, Document de travail
2015
Liste complète des métadonnées

https://hal.archives-ouvertes.fr/hal-01123760
Contributeur : Mathieu Lagrange <>
Soumis le : jeudi 5 mars 2015 - 14:01:43
Dernière modification le : jeudi 10 janvier 2019 - 14:56:03
Document(s) archivé(s) le : samedi 6 juin 2015 - 10:51:09

Fichier

dcasej.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01123760, version 1

Collections

Citation

Dan Stowell, Dimitrios Giannoulis, Emmanouil Benetos, Mathieu Lagrange, Mark D. Plumbley. Detection and Classification of Acoustic Scenes and Events. 2015. 〈hal-01123760〉

Partager

Métriques

Consultations de la notice

307

Téléchargements de fichiers

1624