The NIST 2004 spring rich transcription evaluation : two-axis merging strategy in the context of multiple distance microphone based meeting speaker segmentation

Abstract : This paper presents the ELISA speaker segmentation approach applied on multiple audio channel meeting recordings in the framework of NIST RT’04s meeting (spring) evaluation campaign. As done for BN data speaker segmentation, the ELISA “meeting” system involves two speaker segmentation systems developed individually by the CLIPS and LIA laboratories. The main originality consists in a “two-axis” merging strategy, proposed to deal with both multiple expert segmentation outputs and multiple microphone segmentation outputs. While expert merging strategy did not really lead to an improvement of the performance, the individual microphone segmentation merging strategy allowed to provide a global segmentation output from several audio channels (microphones) with acceptable performance. The best system obtained 22.6% of diarization error rate during the NIST RT’04s meeting evaluation.
Document type :
Conference papers
Complete list of metadatas

Cited literature [4 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01434304
Contributor : Sylvain Meignier <>
Submitted on : Wednesday, March 22, 2017 - 2:43:37 PM
Last modification on : Monday, July 8, 2019 - 3:10:54 PM
Long-term archiving on : Friday, June 23, 2017 - 1:19:34 PM

File

10.1.1.147.1115.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01434304, version 1

Citation

Corinne Fredouille, Daniel Moraru, Sylvain Meignier, Laurent Besacier, Jean-François Bonastre. The NIST 2004 spring rich transcription evaluation : two-axis merging strategy in the context of multiple distance microphone based meeting speaker segmentation. RT2004 Spring Meeting Recognition Workshop, May 2004, Montréal, Canada. ⟨hal-01434304⟩

Share

Metrics

Record views

608

Files downloads

75