Impact of overlapping speech detection on speaker diarization for broadcast news and debates
Résumé
The overlapping speech detection systems developped by Orange and LIMSI for the ETAPE evaluation campaign on French broadcast news and debates are described.
Using either cepstral features or a multi-pitch analysis, a F1-measure for overlapping speech detection up to 59.2% is reported on the TV data of the ETAPE evaluation set, where 6.7% of the speech was measured as overlapping, ranging from 1.2% in the news to 10.4% in the debates.
Overlapping speech segments were excluded during the speaker diarization stage, and these segments were further labelled with the two nearest speaker labels, taking into account the temporal distance. We describe the effects of this strategy for various overlapping speech systems and we show that it improves the diarization error rate in all situations and up to 26.1% relative in our best configuration.
Fichier principal
charletbarraslienard_icassp2013.pdf (278.06 Ko)
Télécharger le fichier
Version soumise.pdf (289 Ko)
Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...