VIDEO SCENE SEGMENTATION OF TV SERIES USING MULTI-MODAL NEURAL FEATURES

Aman Berhe; Camille Guinaudeau; Claude Barras

doi:10.6092/issn.2421-454X/8967

Article Dans Une Revue Series. International Journal of TV Serial Narratives Année : 2019

VIDEO SCENE SEGMENTATION OF TV SERIES USING MULTI-MODAL NEURAL FEATURES

(1) , (2) , (2)

1
2

Aman Berhe

Fonction : Auteur
PersonId : 1208646
IdHAL : aman-berhe
ORCID : 0000-0003-3798-4675

Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur

Camille Guinaudeau

Fonction : Auteur
PersonId : 1106315

Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur

Claude Barras

Fonction : Auteur
PersonId : 17217
IdHAL : claude-barras
IdRef : 165065583

Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur

Résumé

Scene segmentation of a video, a book or TV series allows them to be organized into logical story units (LSU) and is an essential step for representing, extracting and understanding their narrative structures. We propose an automatic scene segmentation method for TV series based on the grouping of adjacent shots and relying on a combination of multimodal neural features: visual features and textual features, further augmented with the temporal information which may improve the clustering of adjacent shots. Reported experiments compare the combination of different features, video frames sub-sampling and various shot clustering algorithms. The proposed method achieved good results, using different metrics, when tested on several seasons of two popular TV series.

Mots clés

Scene segmentation TV series Neural features Multimodal fusion Unsupervised

Domaines

Informatique [cs]

Fichier principal

VIDEO SCENE SEGMENTATION OF TV SERIES USING MULTI-MODAL NEURAL FEATURES.pdf (502.44 Ko)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

Aman Berhe : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03298901

Soumis le : dimanche 25 juillet 2021-08:23:59

Dernière modification le : samedi 7 octobre 2023-21:36:20

Archivage à long terme le : mardi 26 octobre 2021-18:02:14

Dates et versions

hal-03298901 , version 1 (25-07-2021)

Identifiants

HAL Id : hal-03298901 , version 1
DOI : 10.6092/issn.2421-454X/8967

Citer

Aman Berhe, Camille Guinaudeau, Claude Barras. VIDEO SCENE SEGMENTATION OF TV SERIES USING MULTI-MODAL NEURAL FEATURES. Series. International Journal of TV Serial Narratives, 2019, 5 (1), ⟨10.6092/issn.2421-454X/8967⟩. ⟨hal-03298901⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS LIMSI UNIV-PARIS-SACLAY SORBONNE-UNIVERSITE LISN GS-ENGINEERING GS-COMPUTER-SCIENCE LISN-TLP

87 Consultations

156 Téléchargements

VIDEO SCENE SEGMENTATION OF TV SERIES USING MULTI-MODAL NEURAL FEATURES

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager