Pauses and respiratory markers of the structure of book reading

Gérard Bailly 1 Cécilia Gouvernayre 1
1 GIPSA-MAGIC - MAGIC
GIPSA-DPC - Département Parole et Cognition
Abstract : The automatic reading of books by text-to-speech synthesizers requires not only the adequate encoding of the many levels of information and discourse structures in the acoustic signals but also the proper patterns of breathing, so that to pace information and organize discourse at an ecological rhythm. We analyze here the locations and durations of near 4,000 pauses produced by voice donor who has read several audiobooks, freely available on the web. Since the voice was recorded by a close microphone, we also characterized the acoustic markers of inhalation and show that the delay between end of phonation and air intake can be considered as an additional marker of thematic continuity between the two adjacent speech chunks that complements well-documented prosodic cues such as the preboundary tone and lengthening or the pause duration.
Liste complète des métadonnées

Cited literature [28 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-00741667
Contributor : Gérard Bailly <>
Submitted on : Monday, October 15, 2012 - 9:06:21 AM
Last modification on : Monday, April 9, 2018 - 12:22:33 PM
Document(s) archivé(s) le : Wednesday, January 16, 2013 - 3:37:40 AM

File

gb_IS12_final.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00741667, version 1

Citation

Gérard Bailly, Cécilia Gouvernayre. Pauses and respiratory markers of the structure of book reading. 13th Annual Conference of the International Speech Communication Association (InterSpeech 2012), Sep 2012, Portland, United States. pp.Thu.O9d.05. ⟨hal-00741667⟩

Share

Metrics

Record views

579

Files downloads

300