Benefits of prior acoustic segmentation for automatic speaker segmentation

Abstract : The paper investigates the interest of segmentation in acoustic macro classes (like gender or bandwidth) as front-end processing for the segmentation/diarization task. The impact of this prior acoustic segmentation is evaluated in terms of speaker diarization performance in the particular context of NIST RT'03 evaluation (done on the HUB4 broadcast news corpora). It is rarely discussed in the literature, but our work shows that the application of prior acoustic segmentation, in a similar way to the automatic speech recognition task, may be very useful to the speaker segmentation task. Experiments were conducted using two different kinds of speaker segmentation systems developed individually by the LIA and CLIPS laboratories in the framework of the ELISA consortium. For both systems, improvement was observed when combined with prior acoustic segmentation. However, a larger impact, in terms of performance, is observed on the LIA system based on an ascending/HMM approach compared to the CLIPS system based on speaker turn detection.
Document type :
Conference papers
Complete list of metadatas

Cited literature [3 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01434305
Contributor : Sylvain Meignier <>
Submitted on : Wednesday, March 22, 2017 - 2:47:47 PM
Last modification on : Monday, July 8, 2019 - 3:10:54 PM
Long-term archiving on : Friday, June 23, 2017 - 1:23:33 PM

File

530-ICASSP2004Revised_Fredouil...
Files produced by the author(s)

Identifiers

Citation

Sylvain Meignier, Daniel Moraru, Corinne Fredouille, Jean-François Bonastre, Laurent Besacier. Benefits of prior acoustic segmentation for automatic speaker segmentation. International Conference on Acoustics Speech and Signal Processing (ICASSP 2004), May 2004, Montreal, Canada. pp.397-400, ⟨10.1109/ICASSP.2004.1326006⟩. ⟨hal-01434305⟩

Share

Metrics

Record views

254

Files downloads

102