A set of audio features for the morphological description of vocal imitations

Enrico Marchetto; Geoffroy Peeters

Communication Dans Un Congrès Année : 2015

A set of audio features for the morphological description of vocal imitations

(1) , (1)

Enrico Marchetto

Fonction : Auteur correspondant
PersonId : 974529

Connectez-vous pour contacter l'auteur

Analyse et synthèse sonores [Paris]

Geoffroy Peeters

Fonction : Auteur correspondant
PersonId : 6738
IdHAL : geoffroy-peeters
ORCID : 0000-0001-5255-3019
IdRef : 187470472

Connectez-vous pour contacter l'auteur

Analyse et synthèse sonores [Paris]

Résumé

In our current project, vocal signal has to be used to drive sound synthesis. In order to study the mapping between voice and synthesis parameters, the inverse problem is first studied. A set of reference synthesizer sounds have been created and each sound has been imitated by a large number of people. Each reference synthesizer sound belongs to one of the six following morphological categories: "up" , "down" , "up/down" , "impulse" , "repetition" , "stable". The goal of this paper is to study the automatic estimation of these morphological categories from the vocal imitations. We propose three approaches for this. A base-line system is first introduced. It uses standard audio descriptors as inputs for a continuous Hidden Markov Model (HMM) and provides an accuracy of 55.1%. To improve this, we propose a set of slope descriptors which, converted into symbols, are used as input for a discrete HMM. This system reaches 70.8% accuracy. The recognition performance has been further increased by developing specific compact audio descriptors that directly highlight the morphological aspects of sounds instead of relying on HMM. This system allows reaching the highest accuracy: 83.6%.

Domaines

Son [cs.SD] Traitement du signal et de l'image [eess.SP]

Fichier principal

DAFx2015.pdf (417.91 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Enrico Marchetto : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01253651

Soumis le : lundi 11 janvier 2016-10:52:52

Dernière modification le : vendredi 24 mars 2023-14:53:01

Archivage à long terme le : mardi 12 avril 2016-11:10:35

Dates et versions

hal-01253651 , version 1 (11-01-2016)

Identifiants

HAL Id : hal-01253651 , version 1

Citer

Enrico Marchetto, Geoffroy Peeters. A set of audio features for the morphological description of vocal imitations. Proc. of the 18th Intl. Conf. on Digital Audio Effects, Nov 2015, Trondheim, Norway. ⟨hal-01253651⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UPMC CNRS IRCAM STMS SORBONNE-UNIVERSITE SU-SCIENCES

277 Consultations

166 Téléchargements

A set of audio features for the morphological description of vocal imitations

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager