Impact of overlapping speech detection on speaker diarization for broadcast news and debates

Abstract : The overlapping speech detection systems developped by Orange and LIMSI for the ETAPE evaluation campaign on French broadcast news and debates are described. Using either cepstral features or a multi-pitch analysis, a F1-measure for overlapping speech detection up to 59.2% is reported on the TV data of the ETAPE evaluation set, where 6.7% of the speech was measured as overlapping, ranging from 1.2% in the news to 10.4% in the debates. Overlapping speech segments were excluded during the speaker diarization stage, and these segments were further labelled with the two nearest speaker labels, taking into account the temporal distance. We describe the effects of this strategy for various overlapping speech systems and we show that it improves the diarization error rate in all situations and up to 26.1% relative in our best configuration.
Complete list of metadatas

Cited literature [23 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01836475
Contributor : Limsi Publications <>
Submitted on : Tuesday, November 6, 2018 - 10:37:33 AM
Last modification on : Tuesday, October 1, 2019 - 11:46:11 AM
Long-term archiving on : Thursday, February 7, 2019 - 1:30:33 PM

Files

Identifiers

  • HAL Id : hal-01836475, version 1

Collections

Citation

Delphine Charlet, Claude Barras, Jean-Sylvain Liénard. Impact of overlapping speech detection on speaker diarization for broadcast news and debates. IEEE International Conference on Acoustics, Speech, and Signal Processing, Jan 2013, Vancouver, Canada. ⟨hal-01836475⟩

Share

Metrics

Record views

24

Files downloads

178