A benchmarking campaign for the multimodal detection of violent scenes in movies

Claire-Hélène Demarty 1 Cédric Penet 1, 2 Guillaume Gravier 2 Mohammad Soleymani 3
2 TEXMEX - Multimedia content-based indexing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : We present an international benchmark on the detection of violent scenes in movies, implemented as a part of the multimedia benchmarking initiative MediaEval 2011. The task consists in detecting portions of movies where physical violence is present from the automatic analysis of the video, sound and subtitle tracks. A dataset of 15 Hollywood movies was carefully annotated and divided into a development set and a test set containing 3 movies. Annotation strategies and resolution of borderline cases are discussed at length in the paper. Results from 29 runs submitted by the 6 participating sites are analyzed. The first year's results are promising, but considering the use case, there is still a large room for improvement. The detailed analysis of the 2011 benchmark brings valuable insight for the implementation of future evaluation on violent scenes detection in movies.
Document type :
Conference papers
Complete list of metadatas

Cited literature [15 references]  Display  Hide  Download

Contributor : Guillaume Gravier <>
Submitted on : Wednesday, December 19, 2012 - 1:40:37 PM
Last modification on : Friday, November 16, 2018 - 1:23:16 AM
Long-term archiving on : Wednesday, March 20, 2013 - 11:36:12 AM


Files produced by the author(s)


  • HAL Id : hal-00767036, version 1


Claire-Hélène Demarty, Cédric Penet, Guillaume Gravier, Mohammad Soleymani. A benchmarking campaign for the multimodal detection of violent scenes in movies. European Conference on Computer Vision, Workshop on Information Fusion in Computer Vision for Concept Recognition, 2012, Italy. ⟨hal-00767036⟩



Record views


Files downloads