Interactive Sound Texture Synthesis Through Semi-Automatic User Annotations

Diemo Schwarz; Baptiste Caramiaux

Chapitre D'ouvrage Année : 2014

Interactive Sound Texture Synthesis Through Semi-Automatic User Annotations

(1) , (1)

Diemo Schwarz

Fonction : Auteur
PersonId : 1362
IdHAL : diemo-schwarz
ORCID : 0000-0002-4160-4420
IdRef : 090170504

Equipe Interactions musicales temps-réel

Baptiste Caramiaux

Fonction : Auteur
PersonId : 179793
IdHAL : baptiste-caramiaux
ORCID : 0000-0002-4590-106X
IdRef : 159747236

Equipe Interactions musicales temps-réel

Résumé

We present a way to make environmental recordings controllable again by the use of continuous annotations of the high-level semantic parameter one wishes to control, e.g. wind strength or crowd excitation level. A partial annotation can be propagated to cover the entire recording via cross-modal analysis between gesture and sound by canonical time warping (CTW). The annotations serve as a descriptor for lookup in corpus-based concatenative synthesis in order to invert the sound/annotation relationship. The workflow has been evaluated by a preliminary subject test and results on canonical correlation analysis (CCA) show high consistency between annotations and a small set of audio descriptors being well correlated with them. An experiment of the propagation of annotations shows the superior performance of CTW over CCA with as little as 20 s of annotated material.

Mots clés

corpus-based synthesis Canonical Correlation Analysis sound textures audio descriptors canonical time warping

Informatique musicale

Domaines

Son [cs.SD] Interface homme-machine [cs.HC] Musique, musicologie et arts de la scène Traitement du signal et de l'image [eess.SP] Apprentissage [cs.LG] Intelligence artificielle [cs.AI] Ingénierie assistée par ordinateur Multimédia [cs.MM] Vision par ordinateur et reconnaissance de formes [cs.CV] Autre [cs.OH] Traitement du signal et de l'image [eess.SP]

Fichier principal

index.pdf (2.36 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

ircam ircam : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01161076

Soumis le : lundi 8 juin 2015-14:27:31

Dernière modification le : vendredi 24 mars 2023-14:53:00

Archivage à long terme le : mardi 25 avril 2017-04:22:29

Dates et versions

hal-01161076 , version 1 (08-06-2015)

Identifiants

HAL Id : hal-01161076 , version 1

Citer

Diemo Schwarz, Baptiste Caramiaux. Interactive Sound Texture Synthesis Through Semi-Automatic User Annotations. Springer International Publishing; Aramaki, M., Derrien, O., Kronland-Martinet, R., Ystad, S. Sound, Music, and Motion, Lecture Notes in Computer Science, Vol. 8905, pp.372-392, 2014. ⟨hal-01161076⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UPMC CNRS IRCAM STMS HCI-SORBONNE SORBONNE-UNIVERSITE SU-SCIENCES MUSCI

185 Consultations

129 Téléchargements

Interactive Sound Texture Synthesis Through Semi-Automatic User Annotations

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager