Skip to Main content Skip to Navigation
Conference papers

Indexation en locuteur : utilisation d'informations lexicales

Abstract : The automatic speaker indexing consists in splitting the signal into homogeneous segments and cluster- ing them by speakers. However the speaker segments are speci ed with anonymous labels. This paper pro- pose to identify those speakers by extracting their full names pronounced in the show. With a semantic clas- si cation tree, the full names detected in the segment transcription are associated to this segment or to one of its neighbors. Then, a merging method associates a full name to a speaker cluster instead of the anony- mous label. The experiments are carried out over French broadcast news from the ESTER 2005 evalua- tion campaign. About 70% show duration is correctly processed for evaluation corpus.
Complete list of metadatas

Cited literature [6 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01434240
Contributor : Sylvain Meignier <>
Submitted on : Wednesday, March 22, 2017 - 3:28:28 PM
Last modification on : Thursday, April 6, 2017 - 10:12:32 AM
Document(s) archivé(s) le : Friday, June 23, 2017 - 1:52:39 PM

File

final-108.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01434240, version 1

Collections

Citation

Julie Mauclair, Sylvain Meignier, Yannick Estève. Indexation en locuteur : utilisation d'informations lexicales. Les Journées d'Étude sur la Parole (JEP) 2006, 2006, Dinard, France. pp.5. ⟨hal-01434240⟩

Share

Metrics

Record views

128

Files downloads

46