Segmenting TV Series into Scenes using Speaker Diarization

Philippe Ercolessi; Hervé Bredin; Christine Sénac; Philippe Joly

Communication Dans Un Congrès Année : 2011

Segmenting TV Series into Scenes using Speaker Diarization

(1) , (2) , (1) , (1)

1
2

Philippe Ercolessi

Fonction : Auteur

Équipe Structuration, Analyse et MOdélisation de documents Vidéo et Audio

Hervé Bredin

Fonction : Auteur
PersonId : 15856
IdHAL : hbredin
ORCID : 0000-0002-3739-925X
IdRef : 121165779

Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur

Christine Sénac

Fonction : Auteur
PersonId : 743648
IdHAL : christine-senac
IdRef : 200749013

Équipe Structuration, Analyse et MOdélisation de documents Vidéo et Audio

Philippe Joly

Fonction : Auteur
PersonId : 1222042
IdHAL : philippe-joly
IdRef : 168856638

Équipe Structuration, Analyse et MOdélisation de documents Vidéo et Audio

Résumé

In this paper, we propose a novel approach to perform scene segmentation of TV series. Using the output of our existing speaker diarization system, any temporal segment of the video can be described as a binary feature vector. A straightforward segmentation algorithm then allows to group similar contiguous speaker segments into scenes. An additional visual-only color-based segmentation is then used to refine the first segmentation. Experiments are performed on a subset of the Ally McBeal TV series and show promising results, obtained with a rule-free and generic method. For comparison purposes, test corpus annotations and description are made available to the community.

Domaines

Traitement du signal et de l'image [eess.SP]

Fichier principal

SEGMENTING TV SERIES INTO SCENES USING SPEAKER DIARIZATION.pdf (438.08 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Hervé Bredin : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01987819

Soumis le : mercredi 25 janvier 2023-14:59:30

Dernière modification le : mercredi 7 février 2024-03:34:55

Archivage à long terme le : mercredi 26 avril 2023-18:46:16

Dates et versions

hal-01987819 , version 1 (25-01-2023)

Identifiants

HAL Id : hal-01987819 , version 1

Citer

Philippe Ercolessi, Hervé Bredin, Christine Sénac, Philippe Joly. Segmenting TV Series into Scenes using Speaker Diarization. 12th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS 2011), Delft University of Technology, Apr 2011, Delft, Netherlands. pp.1-4. ⟨hal-01987819⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-TLSE2 CNRS LIMSI UT1-CAPITOLE SORBONNE-UNIVERSITE IRIT IRIT-SAMOVA LISN IRIT-SI GS-SPORT-HUMAN-MOVEMENT IRIT-CNRS TOULOUSE-INP UNIV-UT3 UT3-TOULOUSEINP

51 Consultations

15 Téléchargements

Segmenting TV Series into Scenes using Speaker Diarization

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager