Pauses and respiratory markers of the structure of book reading

Gérard Bailly 1 Cécilia Gouvernayre 1
GIPSA-DPC - Département Parole et Cognition
Abstract : The automatic reading of books by text-to-speech synthesizers requires not only the adequate encoding of the many levels of information and discourse structures in the acoustic signals but also the proper patterns of breathing, so that to pace information and organize discourse at an ecological rhythm. We analyze here the locations and durations of near 4,000 pauses produced by voice donor who has read several audiobooks, freely available on the web. Since the voice was recorded by a close microphone, we also characterized the acoustic markers of inhalation and show that the delay between end of phonation and air intake can be considered as an additional marker of thematic continuity between the two adjacent speech chunks that complements well-documented prosodic cues such as the preboundary tone and lengthening or the pause duration.
Liste complète des métadonnées

Cited literature [28 references]  Display  Hide  Download
Contributor : Gérard Bailly <>
Submitted on : Monday, October 15, 2012 - 9:06:21 AM
Last modification on : Monday, April 9, 2018 - 12:22:33 PM
Document(s) archivé(s) le : Wednesday, January 16, 2013 - 3:37:40 AM


Files produced by the author(s)


  • HAL Id : hal-00741667, version 1


Gérard Bailly, Cécilia Gouvernayre. Pauses and respiratory markers of the structure of book reading. 13th Annual Conference of the International Speech Communication Association (InterSpeech 2012), Sep 2012, Portland, United States. pp.Thu.O9d.05. ⟨hal-00741667⟩



Record views


Files downloads