Skip to Main content Skip to Navigation
New interface
Conference papers

Audio Identification based on spectral modeling of bark-bands energy and synchronization through onset detection

Mathieu Ramona 1 Geoffroy Peeters 1 
1 Analyse et synthèse sonores [Paris]
STMS - Sciences et Technologies de la Musique et du Son
Abstract : In this paper, we present for the first time the fingerprint IRCAM system for audio identification in streams. The baseline system relies on a double-nested Short Time Fourier Transform. The first STFT computes the energies of a filter-bank, that are then modelled over 2 s, using a second STFT. We then present recent improvements of our system: first the inclusion of perceptual scales for amplitude and frequency (Bark bands), then the synchronization of stream and database frames using an onset detection system. The performance of these improvements is tested on a large set of real audio streams. We compare our results with the results of re-implementations of the two state-of-the-art systems of Philips and Shazam.
Mots-clés : NA Informatique musicale
Complete list of metadata

Cited literature [14 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01161269
Contributor : ircam ircam Connect in order to contact the contributor
Submitted on : Monday, June 8, 2015 - 2:34:06 PM
Last modification on : Tuesday, March 15, 2022 - 3:19:41 AM
Long-term archiving on: : Tuesday, April 25, 2017 - 4:49:04 AM

File

index.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01161269, version 1

Citation

Mathieu Ramona, Geoffroy Peeters. Audio Identification based on spectral modeling of bark-bands energy and synchronization through onset detection. ICASSP, May 2011, Prague, Czech Republic. pp.1-1. ⟨hal-01161269⟩

Share

Metrics

Record views

122

Files downloads

278