Skip to Main content Skip to Navigation
Conference papers

Segmenting TV Series into Scenes using Speaker Diarization

Abstract : In this paper, we propose a novel approach to perform scene segmentation of TV series. Using the output of our existing speaker diarization system, any temporal segment of the video can be described as a binary feature vector. A straightforward segmentation algorithm then allows to group similar contiguous speaker segments into scenes. An additional visual-only color-based segmentation is then used to refine the first segmentation. Experiments are performed on a subset of the Ally McBeal TV series and show promising results, obtained with a rule-free and generic method. For comparison purposes, test corpus annotations and description are made available to the community.
Document type :
Conference papers
Complete list of metadata
Contributor : Hervé Bredin Connect in order to contact the contributor
Submitted on : Monday, January 21, 2019 - 12:36:46 PM
Last modification on : Tuesday, June 14, 2022 - 10:39:41 AM


  • HAL Id : hal-01987819, version 1


Philippe Ercolessi, Hervé Bredin, Christine Sénac, Philippe Joly. Segmenting TV Series into Scenes using Speaker Diarization. WIAMIS 2011, 12th International Workshop on Image Analysis for Multimedia Interactive Services, 2011, Delft, Netherlands. ⟨hal-01987819⟩



Record views