Time structure and detection of the multivoiced segments in mixed speech

Abstract : When two speech signals are mixed in a single channel the voiced parts of any of them remain mostly unaltered during the voicing interruptions of the other, i.e. pauses and voiceless consonants. The mixture is made of 3 types of multivoiced segments noted 0V (unvoiced), 1V (one voicing) and 2V (two voicings). A statistical study of read-aloud texts reveals that total time spent in the 1V state is twice as long as the time spent in any of the other states. The HSC multipitch algorithm, based on a specific mechanism that eliminates the f0 halving and doubling errors, is used to locate the 3 segments types in the signal. This feature is illustrated by the task of spotting a short utterance repeatedly mixed with a long text.
Complete list of metadatas

Cited literature [7 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01836479
Contributor : Limsi Publications <>
Submitted on : Tuesday, November 6, 2018 - 10:30:16 AM
Last modification on : Tuesday, October 1, 2019 - 11:46:11 AM
Long-term archiving on : Thursday, February 7, 2019 - 1:18:04 PM

File

2011 Lienard Barras Signol ICP...
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01836479, version 1

Collections

Citation

Jean-Sylvain Liénard, Claude Barras, François Signol. Time structure and detection of the multivoiced segments in mixed speech. International Congress of Phonetic Sciences, Jan 2011, Hong Kong, China. ⟨hal-01836479⟩

Share

Metrics

Record views

12

Files downloads

7