SATIN: A Persistent Musical Database for Music Information Retrieval

Abstract : This paper introduces SATIN, the Set of Audio Tags and Identifiers Normalized. SATIN is a database of 400k audio-related metadata and identifiers that aims at facilitating reproducibility and comparisons among the MIR algorithms. The idea is to take advantage of partnerships between scientists and private companies that host millions of tracks. Scientists can send their feature extraction algorithm to companies along SATIN's identifiers and retrieve the corresponding features. This procedure allows the MIR community to have access to more tracks for classification purposes. Afterward, scientists can provide to the MIR community the classification result for each track, which can then be compared with other algorithms results. SATIN thus resolves the major problems of accessing more tracks, managing copyrights locks, saving computation time, and guaranteeing consistency over research databases. We also introduce SOFT1, the first Set Of FeaTures extracted by a company thanks to SATIN.
Complete list of metadatas
Contributor : Yann Bayle <>
Submitted on : Friday, July 28, 2017 - 1:26:45 PM
Last modification on : Tuesday, May 29, 2018 - 9:14:04 AM


  • HAL Id : hal-01570099, version 1



Yann Bayle, Pierre Hanna, Matthias Robine. SATIN: A Persistent Musical Database for Music Information Retrieval. 15th International Workshop on Content-Based Multimedia Indexing (CBMI), Jun 2017, Florence, Italy. pp.1-5. ⟨hal-01570099⟩



Record views