From Text Detection in Videos to Person Identification

Johann Poignant 1 Laurent Besacier 1 Georges Quénot 2, * Franck Thollard 2
* Auteur correspondant
2 MRIM - Modélisation et Recherche d’Information Multimédia [Grenoble]
LIG - Laboratoire d'Informatique de Grenoble, Inria - Institut National de Recherche en Informatique et en Automatique
Abstract : We present in this article a video OCR system that detects and recognizes overlaid texts in video as well as its application to person identification in video documents. We proceed in several steps. First, text detection and temporal tracking are performed. After adaptation of images to a standard OCR system, a final post-processing combines multiple transcriptions of the same text box. The semi-supervised adaptation of this system to a particular video type (video broadcast from a French TV) is proposed and evaluated. The system is efficient as it runs 3 times faster than real time (including the OCR step) on a desktop Linux box. Both text detection and recognition are evaluated individually and through a person recognition task where it is shown that the combination of OCR and audio (speaker) information can greatly improve the performances of a state of the art audio based person identification system.
Type de document :
Communication dans un congrès
Lisa O'Conner. ICME 2012 - International Conference on Multimedia and Expo, Jul 2012, Melbourne, VIC, Australia. Conference Publishing Services (CPS), pp.854-859, 2012, 〈10.1109/ICME.2012.119〉
Liste complète des métadonnées

Littérature citée [19 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-00767383
Contributeur : Georges Quénot <>
Soumis le : mercredi 19 décembre 2012 - 17:27:16
Dernière modification le : jeudi 11 octobre 2018 - 08:48:04
Document(s) archivé(s) le : mercredi 20 mars 2013 - 11:34:16

Fichier

Poignant-Besacier-Quenot-Tholl...
Fichiers produits par l'(les) auteur(s)

Identifiants

Citation

Johann Poignant, Laurent Besacier, Georges Quénot, Franck Thollard. From Text Detection in Videos to Person Identification. Lisa O'Conner. ICME 2012 - International Conference on Multimedia and Expo, Jul 2012, Melbourne, VIC, Australia. Conference Publishing Services (CPS), pp.854-859, 2012, 〈10.1109/ICME.2012.119〉. 〈hal-00767383〉

Partager

Métriques

Consultations de la notice

282

Téléchargements de fichiers

295