Fine-granularity semantic video annotation: An approach based on automatic shot level concept detection and object recognition - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue International Journal of Pervasive Computing and Communications Année : 2013

Fine-granularity semantic video annotation: An approach based on automatic shot level concept detection and object recognition

Résumé

A fine-grained video content indexing, retrieval, and adaptation requires accurate metadata describing the video structure and semantics to the lowest granularity, i.e. to the object level. The authors address these requirements by proposing semantic video content annotation tool (SVCAT) for structural and high-level semantic video annotation. SVCAT is a semi-automatic MPEG-7 standard compliant annotation tool, which produces metadata according to a new object-based video content model introduced in this work. Videos are temporally segmented into shots and shots level concepts are detected automatically using ImageNet as background knowledge. These concepts are used as a guide to easily locate and select objects of interest which are then tracked automatically to generate an object level metadata. The integration of shot based concept detection with object localization and tracking drastically alleviates the task of an annotator. The paper aims to discuss these issues.
Fichier non déposé

Dates et versions

hal-01339265 , version 1 (29-06-2016)

Identifiants

Citer

Vanessa El Khoury, Martin Jergler, Getnet Abebe Bayou, David Coquil, Harald Kosch. Fine-granularity semantic video annotation: An approach based on automatic shot level concept detection and object recognition. International Journal of Pervasive Computing and Communications, 2013, 3, 9, pp.243-269. ⟨10.1108/IJPCC-07-2013-0019⟩. ⟨hal-01339265⟩
124 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More