Content-based Retrieval of Environmental Sounds by Multiresolution Analysis - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2012

Content-based Retrieval of Environmental Sounds by Multiresolution Analysis

Résumé

Query by example retrieval of environmental sound recordings is a research area with applications to sound design, music composition and automatic suggestion of metadata for the labeling of sound databases. Retrieval problems are usually composed of successive feature extraction (FE) and similarity measurement (SM) steps, in which a set of extracted features encoding important properties of the sound recordings are used to compute the distance between elements in the database. Previous research has pointed out that successful features in the domains of speech and music, like MFCCs, might fail at describing environmental sounds, which have intrinsic variability and noisy characteristics. We present a set of novel multiresolution features obtained by modeling the distribution of wavelet subband coefficients with generalized Gaussian densities (GGDs). We define the similarity measure in terms of the Kullback-Leibler divergence between GGDs. Experimental results on a database of 1020 environmental sound recordings show that our approach always outperforms a method based on traditional MFCC features and Euclidean distance, improving retrieval rates from 51% to 62%.
Fichier principal
Vignette du fichier
index.pdf (415.42 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01161434 , version 1 (08-06-2015)

Identifiants

  • HAL Id : hal-01161434 , version 1

Citer

Ianis Lallemand, Diemo Schwarz, Thierry Artières. Content-based Retrieval of Environmental Sounds by Multiresolution Analysis. SMC2012, Jul 2012, Copenhague, Denmark. pp.1-1. ⟨hal-01161434⟩
190 Consultations
93 Téléchargements

Partager

Gmail Facebook X LinkedIn More