The bag-of-frames approach: a not so sufficient model for urban soundscapes

Abstract : The "bag-of-frames" approach (BOF), which encodes audio signals as the long-term statistical distribution of short-term spectral features, is commonly regarded as an effective and sufficient way to represent environmental sound recordings (soundscapes) since its introduction in an influential 2007 article. The present paper describes a concep-tual replication of this seminal article using several new soundscape datasets, with results strongly questioning the adequacy of the BOF approach for the task. We show that the good accuracy originally re-ported with BOF likely result from a particularly thankful dataset with low within-class variability, and that for more realistic datasets, BOF in fact does not perform significantly better than a mere one-point av-erage of the signal's features. Soundscape modeling, therefore, may not be the closed case it was once thought to be. Progress, we ar-gue, could lie in reconsidering the problem of considering individual acoustical events within each soundscape.
Type de document :
Article dans une revue
Journal of the Acoustical Society of America, Acoustical Society of America, 2015, 138 (5), pp.487-492
Liste complète des métadonnées

Littérature citée [18 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-01082501
Contributeur : Mathieu Lagrange <>
Soumis le : vendredi 10 juin 2016 - 13:21:26
Dernière modification le : jeudi 7 février 2019 - 17:19:27

Fichiers

lagrangeBof2014.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01082501, version 2
  • ARXIV : 1412.4052

Citation

Mathieu Lagrange, Grégoire Lafay, Boris Defreville, Jean-Julien Aucouturier. The bag-of-frames approach: a not so sufficient model for urban soundscapes. Journal of the Acoustical Society of America, Acoustical Society of America, 2015, 138 (5), pp.487-492. 〈hal-01082501v2〉

Partager

Métriques

Consultations de la notice

542

Téléchargements de fichiers

123