SATIN: A persistent musical database for music information retrieval and a supporting deep learning experiment on song instrumental classification

This paper introduces SATIN, the Set of Audio Tags and Identifiers Normalized. SATIN is a database of 400k audio-related metadata and identifiers that aims at facilitating reproducibility and comparisons among the Music Information Retrieval (MIR) algorithms. The idea is to take advantage of partnerships between scientists and private companies that host millions of tracks. Scientists can send their feature extraction algorithm to companies along SATIN identifiers and retrieve the corresponding features. This procedure allows the MIR community to have access to more tracks for classification purposes. Afterwards, scientists can provide to the MIR community the classification result for each track, which can then be compared with other algorithms results. SATIN thus resolves the major problems of accessing more tracks, managing copyrights locks, saving computation time, and guaranteeing consistency over research databases. We introduce SOFT1, the first Set Of FeaTures extracted by a company thanks to SATIN. We propose a supporting experiment classifying instrumentals and songs to detail a possible use of SATIN. We compare a deep learning approach —that has emerged in recent years in MIR— with a knowledge-based approach.

Mots clés

Playlist generation Reproducibility Signal analysis Signal processing algorithms Music autotagging Acoustic signal processing Classification of instrumentals and songs Content-based audio retrieval Database Machine learning algorithms Music information retrieval Music recommendation

Domaines

Intelligence artificielle [cs.AI] Base de données [cs.DB] Recherche d'information [cs.IR] Apprentissage [cs.LG] Multimédia [cs.MM] Son [cs.SD] Traitement du signal et de l'image [eess.SP]

Yann Bayle : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01762796

Soumis le : mardi 10 avril 2018-14:15:26

Dernière modification le : lundi 5 juin 2023-16:52:12

Dates et versions

hal-01762796 , version 1 (10-04-2018)

Identifiants

HAL Id : hal-01762796 , version 1
DOI : 10.1007/s11042-018-5797-8

Citer

Yann Bayle, Matthias Robine, Pierre Hanna. SATIN: A persistent musical database for music information retrieval and a supporting deep learning experiment on song instrumental classification. Multimedia Tools and Applications, 2018, ⟨10.1007/s11042-018-5797-8⟩. ⟨hal-01762796⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS

70 Consultations

0 Téléchargements