A weakly-supervised discriminative model for audio-to-score alignment

In this paper, we consider a new discriminative approach to the problem of audio-to-score alignment. We consider the two distinct informations provided by the music scores: (i) an exact ordered list of musical events and (ii) an approximate prior information about relative duration of events. We extend the basic dynamic time warping algorithm to a convex problem that learns optimal classifiers for all events while jointly aligning files, using this weak supervision only. We show that the relative duration between events can be easily used as a penalization of our cost function and allows us to drastically improve performances of our approach. We demonstrate the validity of our approach on a large and realistic dataset.

Mots clés

weakly supervised learning score-following audio-to-score

Domaines

Traitement du signal et de l'image [eess.SP]

Fichier principal

icassp2016.pdf (283.14 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Philippe Cuvillier : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01251018

Soumis le : mardi 5 janvier 2016-15:12:06

Dernière modification le : vendredi 19 avril 2024-16:18:55

Archivage à long terme le : jeudi 7 avril 2016-15:26:31

Dates et versions

hal-01251018 , version 1 (05-01-2016)

Identifiants

HAL Id : hal-01251018 , version 1

Citer

Rémi Lajugie, Piotr Bojanowski, Philippe Cuvillier, Sylvain Arlot, Francis Bach. A weakly-supervised discriminative model for audio-to-score alignment. 41st International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Mar 2016, Shanghai, China. ⟨hal-01251018⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENS-PARIS UPMC CNRS INRIA IRCAM LM-ORSAY STMS INRIA2 PSL UNIV-PARIS-SACLAY SORBONNE-UNIVERSITE SU-SCIENCES GS-MATHEMATIQUES

646 Consultations

658 Téléchargements