Acoustic scene classification: An evaluation of an extremely compact feature representation

Gustavo Sena Mafra; Ngoc Q K Duong; Alexey Ozerov; Patrick Pérez

Communication Dans Un Congrès Année : 2016

Acoustic scene classification: An evaluation of an extremely compact feature representation

(1) , (2) , (2) , (2)

1
2

Gustavo Sena Mafra

Fonction : Auteur
PersonId : 994019

Universidade Federal de Santa Catarina = Federal University of Santa Catarina [Florianópolis]

Ngoc Q K Duong

Fonction : Auteur

Technicolor R & I [Cesson Sévigné]

Alexey Ozerov

Fonction : Auteur

Technicolor R & I [Cesson Sévigné]

Patrick Pérez

Fonction : Auteur
PersonId : 1022281

Technicolor R & I [Cesson Sévigné]

Résumé

This paper investigates several approaches to address the acoustic scene classification (ASC) task. We start from low-level feature representation for segmented audio frames and investigate different time granularity for feature aggregation. We study the use of support vector machine (SVM), as a well-known classifier, together with two popular neural network (NN) architectures, namely mul-tilayer perceptron (MLP) and convolutional neural network (CNN). We evaluate the performance of these approaches on benchmark datasets provided from the 2013 and 2016 Detection and Classification of Acoustic Scenes and Events (DCASE) challenges. We observe that a simple approach exploiting averaged Mel-log-spectrograms and SVM can obtain even better results than NN-based approaches and comparable performance with the best systems in the DCASE 2013 challenge.

Mots clés

Acoustic scene classification Audio features Multilayer Perceptron Convolutional Neural Network Support Vector Machine

Domaines

Traitement du signal et de l'image [eess.SP]

Fichier principal

SenaMafra-DCASE2016workshop.pdf (179.79 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Alexey Ozerov : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01400986

Soumis le : mardi 22 novembre 2016-17:08:50

Dernière modification le : lundi 4 mai 2020-17:00:04

Archivage à long terme le : mardi 21 mars 2017-04:30:50

Dates et versions

hal-01400986 , version 1 (22-11-2016)

Identifiants

HAL Id : hal-01400986 , version 1

Citer

Gustavo Sena Mafra, Ngoc Q K Duong, Alexey Ozerov, Patrick Pérez. Acoustic scene classification: An evaluation of an extremely compact feature representation. Detection and Classification of Acoustic Scenes and Events 2016, Sep 2016, Budapest, Hungary. ⟨hal-01400986⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

783 Consultations

603 Téléchargements

Acoustic scene classification: An evaluation of an extremely compact feature representation

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Partager