Skip to Main content Skip to Navigation
Conference papers

Combining Multi-Probe Histogram and Order-Statistics Based LSH for Scalable Audio Content Retrieval

Yi Yu Michel Crucianu 1 Vincent Oria Ernesto Damiani 
1 CEDRIC - VERTIGO - CEDRIC. Données complexes, apprentissage et représentations
CEDRIC - Centre d'études et de recherche en informatique et communications
Abstract : To improve the reliability and the scalability of content-based retrieval of variant audio tracks from large music databases, we suggest a new multi-stage LSH scheme consisting in (i) the extraction of compact but accurate representations from audio tracks by exploiting the LSH idea to summarize audio tracks, and (ii) an adequate organization of the resulting representations in LSH tables, retaining almost the same accuracy as an exact kNN retrieval. In the first stage we use major bins of successive chroma features and calculate a multi-probe histogram (MPH) that is concise but retains the information about local temporal correlations. In the second stage, based on the order statistics (OS) of MPH, we propose a new LSH scheme, OS-LSH, to organize and probe the histograms. The representation and organization of the audio tracks are storage efficient and support robust and scalable retrieval. Extensive experiments over a large dataset with 30,000 real audio tracks confirm the effectiveness and efficiency of the proposed scheme.
Document type :
Conference papers
Complete list of metadata
Contributor : Laboratoire CEDRIC Connect in order to contact the contributor
Submitted on : Friday, March 6, 2015 - 11:26:30 AM
Last modification on : Wednesday, September 28, 2022 - 6:00:14 AM


  • HAL Id : hal-01125759, version 1



Yi Yu, Michel Crucianu, Vincent Oria, Ernesto Damiani. Combining Multi-Probe Histogram and Order-Statistics Based LSH for Scalable Audio Content Retrieval. ACM Multimedia, Nov 2010, X, France. pp.381-390. ⟨hal-01125759⟩



Record views