Acoustic scene classification: An evaluation of an extremely compact feature representation - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2016

Acoustic scene classification: An evaluation of an extremely compact feature representation

Résumé

This paper investigates several approaches to address the acoustic scene classification (ASC) task. We start from low-level feature representation for segmented audio frames and investigate different time granularity for feature aggregation. We study the use of support vector machine (SVM), as a well-known classifier, together with two popular neural network (NN) architectures, namely mul-tilayer perceptron (MLP) and convolutional neural network (CNN). We evaluate the performance of these approaches on benchmark datasets provided from the 2013 and 2016 Detection and Classification of Acoustic Scenes and Events (DCASE) challenges. We observe that a simple approach exploiting averaged Mel-log-spectrograms and SVM can obtain even better results than NN-based approaches and comparable performance with the best systems in the DCASE 2013 challenge.
Fichier principal
Vignette du fichier
SenaMafra-DCASE2016workshop.pdf (179.79 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01400986 , version 1 (22-11-2016)

Identifiants

  • HAL Id : hal-01400986 , version 1

Citer

Gustavo Sena Mafra, Ngoc Q K Duong, Alexey Ozerov, Patrick Pérez. Acoustic scene classification: An evaluation of an extremely compact feature representation. Detection and Classification of Acoustic Scenes and Events 2016, Sep 2016, Budapest, Hungary. ⟨hal-01400986⟩
783 Consultations
603 Téléchargements

Partager

Gmail Facebook X LinkedIn More