Skip to Main content Skip to Navigation
Conference papers

Comparing identification of vocal imitations and computational sketches of everyday sounds

Guillaume Lemaître 1 Olivier Houix 1 Frédéric Voisin 1 Nicolas Misdariis 1 Patrick Susini 1
1 Equipe Perception et design sonores
STMS - Sciences et Technologies de la Musique et du Son
Abstract : Sounds are notably difficult to describe. It is thus not surprising that human speakers often use many imitative vocalizations to communicate about sounds. In practice,vocal imitations of non-speech everyday sounds (e.g. the sound of a car passing by) arevery effective: listeners identify sounds better with vocal imitations than with verbal descriptions, despite the fact that vocal imitations are often inaccurate, constrained by the human vocal apparatus. The present study investigated the semantic representations evoked by vocal imitations by experimentally quantifying how well listeners could match sounds to category labels. Itcompared two different types of sounds: human vocal imitations, and computational auditory sketches (created by algorithmic computations), both based on easily identifiable sounds (sounds of human actions and manufactured products). The results show that performance with the best vocal imitations was similar to the best auditory sketches for most categories of sounds.More detailed analyses showed that the acoustic distance between vocal imitations and referent sounds is not sufficient to account for such performance. They suggested that instead of reproducing the acoustic properties of the referent sound as accurately as vocally possible, vocal imitations focus on a few important features dependent on each particular sound category.
Document type :
Conference papers
Complete list of metadata
Contributor : Ircam Ircam Connect in order to contact the contributor
Submitted on : Sunday, January 29, 2017 - 9:26:19 PM
Last modification on : Tuesday, July 13, 2021 - 2:17:12 PM
Long-term archiving on: : Sunday, April 30, 2017 - 12:24:35 PM


Files produced by the author(s)



Guillaume Lemaître, Olivier Houix, Frédéric Voisin, Nicolas Misdariis, Patrick Susini. Comparing identification of vocal imitations and computational sketches of everyday sounds. Meeting of the Acoustical Society of America, Nov 2016, Honolulu, United States. pp.3390 - 3390, ⟨10.1121/1.4970854⟩. ⟨hal-01448968⟩



Record views


Files downloads