Skip to Main content Skip to Navigation
Conference papers

A conditional random field approach for face identification in broadcast news using overlaid text

Abstract : We investigate the problem of face identification in broadcast programs where people names are obtained from text overlays automatically processed with Optical Character Recognition (OCR) and further linked to the faces throughout the video. To solve the face-name association and propagation, we propose a novel approach that combines the positive effects of two Conditional Random Field (CRF) models: a CRF for person diarization (joint temporal segmentation and association of voices and faces) that benefit from the combination of multiple cues including as main contributions the use of identification sources (OCR appearances) and recurrent local face visual background (LFB) playing the role of a namedness feature; a second CRF for the joint identification of the person clusters that improves identification performance thanks to the use of further diarization statistics. Experiments conducted on a recent and substantial public dataset of 7 different shows demonstrate the interest and complementarity of the different modeling steps and information sources, leading to state of the art results.
Document type :
Conference papers
Complete list of metadata

Cited literature [23 references]  Display  Hide  Download
Contributor : sylvain meignier Connect in order to contact the contributor
Submitted on : Saturday, April 1, 2017 - 12:36:34 AM
Last modification on : Tuesday, December 8, 2020 - 9:44:14 AM
Long-term archiving on: : Sunday, July 2, 2017 - 12:17:29 PM


Files produced by the author(s)




Paul Gay, Elie Khoury, Sylvain Meignier, Jean-Marc Odobez, Paul Deléglise. A conditional random field approach for face identification in broadcast news using overlaid text. IEEE International Conference on Image Processing 2014, 2014, Paris, France. pp.318 - 322, ⟨10.1109/ICIP.2014.7025063⟩. ⟨hal-01433221⟩



Record views


Files downloads