Service interruption on Monday 11 July from 12:30 to 13:00: all the sites of the CCSD (HAL, EpiSciences, SciencesConf, AureHAL) will be inaccessible (network hardware connection).
Skip to Main content Skip to Navigation
Conference papers

Speaker diarization: about whom the speaker is talking?

Abstract : The automatic speaker diarization consists in splitting the signal into homogeneous segments and clustering them by speakers. However the speaker segments are specified with anonymous labels. This pa- per proposed a solution to identify those speakers by extracting their full names pronounced in the show. With a semantic classification tree automatically built on a training corpus, the full names detected in transcription of a segment are associated to this segment or to one of its neighbors. Then, a merging method allows to associate a full name to a speaker cluster instead of a anonymous label provided by the diarization. The experiments are carried out over French broadcast news records from the ESTER 2005 evaluation campaign. About 70% show duration is correctly processed for both development and eval- uation corpora. On the evaluation corpus, 18.15% show duration is wrongly named and no decision is taken for 11.91% show duration.
Document type :
Conference papers
Complete list of metadata

Cited literature [13 references]  Display  Hide  Download
Contributor : sylvain meignier Connect in order to contact the contributor
Submitted on : Thursday, February 9, 2017 - 2:14:03 PM
Last modification on : Wednesday, January 6, 2021 - 10:30:02 AM
Long-term archiving on: : Wednesday, May 10, 2017 - 1:52:53 PM


Files produced by the author(s)


  • HAL Id : hal-01434121, version 1



Julie Mauclair, Sylvain Meignier, yannick Estève. Speaker diarization: about whom the speaker is talking?. IEEE Speaker Odyssey 2006, 2006, San Juan Puerto Rico. ⟨hal-01434121⟩



Record views


Files downloads