Skip to Main content Skip to Navigation
Conference papers

NIST RT'05S Evaluation: Pre-processing Techniques and Speaker Diarization on Multiple Microphone Meetings

Abstract : This paper presents different pre-processing techniques, coupled with three speaker diarization systems in the framework of the NIST 2005 Spring Rich Transcription campaign (RT'05S). The pre-processing techniques aim at providing a signal quality index in order to build unique " virtual " signal obtained from all the microphone recordings available for a meeting. The unique " virtual " signal relies on a weighted sum of the different microphones while the signal quality index is given according to a signal to noise ratio. Two methods are used in this paper to compute the instantaneous signal to noise ratio: speech activity detection based approach and a noise spectrum estimate. The speaker diarization task is performed using systems developed by different labs: the LIA, LIUM and CLIPS. Among the different system submissions made by these three labs, the best system obtained 24.5 % speaker diarization error for the conference subdomain and 18.4 % for lecture subdomain.
Complete list of metadatas

Cited literature [15 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01434285
Contributor : Sylvain Meignier <>
Submitted on : Wednesday, March 22, 2017 - 3:20:38 PM
Last modification on : Thursday, February 27, 2020 - 10:44:03 AM
Document(s) archivé(s) le : Friday, June 23, 2017 - 1:30:20 PM

File

RT05.pdf
Files produced by the author(s)

Identifiers

Citation

Dan Istrate, Corinne Fredouille, Sylvain Meignier, Laurent Besacier, Jean Bonastre. NIST RT'05S Evaluation: Pre-processing Techniques and Speaker Diarization on Multiple Microphone Meetings. RT'05S Workshop, 2005, Edinburgh, United Kingdom. pp.428 - 439, ⟨10.1007/11677482_36⟩. ⟨hal-01434285⟩

Share

Metrics

Record views

446

Files downloads

267