Multimodal Understanding for Person Recognition in Video Broadcasts, INTERSPEECH, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-01194244
Unsupervised face identification in TV content using audio-visual sources, 2013 11th International Workshop on Content-Based Multimedia Indexing (CBMI), 2013. ,
DOI : 10.1109/CBMI.2013.6576591
URL : https://hal.archives-ouvertes.fr/hal-00812334
The First Official REPERE Evaluation, SLAM-INTERSPEECH, 2013. ,
Person Instance Graphs for Named Speaker Identification in TV Broadcast, 2014. ,
Integer Linear Programming for Speaker Diarization and Cross-Modal Identification in TV Broadcast, INTERSPEECH, 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-00953095
QCompere at REPERE 2013, SLAM-INTERSPEECH, 2013. ,
Person instance graphs for mono-, cross- and multi-modal person recognition in multimedia data: application to speaker identification in TV broadcast, IJMIR, 2014. ,
DOI : 10.1109/79.888862
URL : https://hal.archives-ouvertes.fr/hal-01690350
A comparative study using manual and automatic transcriptions for diarization, IEEE Workshop on Automatic Speech Recognition and Understanding, 2005., 2005. ,
DOI : 10.1109/ASRU.2005.1566507
URL : https://www.lrde.epita.fr/~reda/cours/speech/speakerDiarization/1566507.pdf
Speaker diarization from speech transcripts, INTERSPEECH, 2004. ,
Speaker, Environment And Channel Change Detection And Clustering Via The Bayesian Information Criterion, In DARPA Broadcast News Trans. and Under, 1998. ,
Histograms of Oriented Gradients for Human Detection, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), 2005. ,
DOI : 10.1109/CVPR.2005.177
URL : https://hal.archives-ouvertes.fr/inria-00548512
Models Cascade for Tree-Structured Named Entity Detection, IJCNLP, 2011. ,
Extracting true speaker identities from transcriptions, INTERSPEECH, 2007. ,
PERCOLI: a person identification system for the 2013 REPERE challenge, SLAM-INTERSPEECH, 2013. ,
Comparison of two methods for unsupervised person identification in TV shows, 2014 12th International Workshop on Content-Based Multimedia Indexing (CBMI), 2014. ,
DOI : 10.1109/CBMI.2014.6849828
URL : https://hal.archives-ouvertes.fr/hal-01433260
The REPERE Corpus : a Multimodal Corpus for Person Recognition, LREC, 2012. ,
Face Recognition from Caption-Based Supervision, International Journal of Computer Vision, vol.57, issue.2, p.2012 ,
DOI : 10.1145/1027527.1027689
URL : https://hal.archives-ouvertes.fr/inria-00585834
Named Faces: putting names to faces, IEEE Intelligent Systems, vol.14, issue.5, 1999. ,
DOI : 10.1109/5254.796089
Automatic named identification of speakers using diarization and ASR systems, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, 2009. ,
DOI : 10.1109/ICASSP.2009.4960644
URL : https://hal.archives-ouvertes.fr/hal-00412431
A presentation of the REPERE challenge, 2012 10th International Workshop on Content-Based Multimedia Indexing (CBMI), 2012. ,
DOI : 10.1109/CBMI.2012.6269851
Speaker Diarization: About whom the Speaker is Talking ?, 2006 IEEE Odyssey, The Speaker and Language Recognition Workshop, 2006. ,
DOI : 10.1109/ODYSSEY.2006.248114
URL : https://hal.archives-ouvertes.fr/hal-01434121
Unsupervised Speaker Identification in TV Broadcast Based on Written Names, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.23, issue.1, p.2015 ,
DOI : 10.1109/TASLP.2014.2367822
URL : https://hal.archives-ouvertes.fr/hal-01060827
From Text Detection in Videos to Person Identification, 2012 IEEE International Conference on Multimedia and Expo, 2012. ,
DOI : 10.1109/ICME.2012.119
URL : https://hal.archives-ouvertes.fr/hal-00767383
Towards a better integration of written names for unsupervised speakers identification in videos, SLAM-INTERSPEECH, 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-00953089
Unsupervised speaker identification using overlaid texts in TV broadcast, 2012. ,
URL : https://hal.archives-ouvertes.fr/hal-00767427
Naming multi-modal clusters to identify persons in TV broadcast, Multimedia Tools and Applications, vol.6, issue.3, p.2015 ,
DOI : 10.1145/1101149.1101155
URL : https://hal.archives-ouvertes.fr/hal-01230628
Scene understanding for identifying persons in TV shows: Beyond face authentication, 2014 12th International Workshop on Content-Based Multimedia Indexing (CBMI), 2014. ,
DOI : 10.1109/CBMI.2014.6849829
URL : https://hal.archives-ouvertes.fr/hal-01194242
Name-It: naming and detecting faces in news videos, IEEE Multimedia, vol.6, issue.1, 1999. ,
DOI : 10.1109/93.752960
URL : http://www.ri.cmu.edu/pub_files/pub2/satoh_s_1999_1/satoh_s_1999_1.pdf
Facial Landmarks Detector Learned by the Structured Output SVM, VISAPP, 2012. ,
DOI : 10.1007/978-3-642-38241-3_26
Naming every individual in news video monologues, Proceedings of the 12th annual ACM international conference on Multimedia , MULTIMEDIA '04, 2004. ,
DOI : 10.1145/1027527.1027666
URL : http://www.cs.cmu.edu/~juny/Prof/papers/acmmm04a-jyang.pdf
Multiple instance learning for labeling faces in broadcasting news video, Proceedings of the 13th annual ACM international conference on Multimedia , MULTIMEDIA '05, 2005. ,
DOI : 10.1145/1101149.1101155