Trecvid 2010?an overview of the goals, tasks, data, evaluation mechanisms, and metrics, 2011. ,
URL : https://hal.archives-ouvertes.fr/hal-00953843
High-Level Feature Detection from Video in TRECVid: A 5-Year Retrospective of Achievements, Multimedia Content Analysis, pp.151-174, 2009. ,
DOI : 10.1007/978-0-387-76569-3_6
The First Official REPERE Evaluation, SLAM-INTERSPEECH, 2013. ,
A presentation of the REPERE challenge, 2012 10th International Workshop on Content-Based Multimedia Indexing (CBMI), 2012. ,
DOI : 10.1109/CBMI.2012.6269851
The REPERE Corpus : a Multimodal Corpus for Person Recognition, LREC, 2012. ,
A Multimodal Approach to Speaker Diarization on TV Talk-Shows, IEEE Transactions on Multimedia, vol.15, issue.3, pp.509-520, 2013. ,
DOI : 10.1109/TMM.2012.2233724
Speaker diarization from speech transcripts, the 5th Annual Conference of the International Speech Communication Association, INTERSPEECH, p.p, 2004. ,
Speaker diarization: about whom the speaker is talking? " in IEEE Odyssey 2006 -The Speaker and Language Recognition Workshop, p.p, 2006. ,
From Text Detection in Videos to Person Identification, 2012 IEEE International Conference on Multimedia and Expo, 2012. ,
DOI : 10.1109/ICME.2012.119
URL : https://hal.archives-ouvertes.fr/hal-00767383
Unsupervised naming of speakers in broadcast TV: using written names, pronounced names or both, INTERSPEECH, 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-00953088
Nommage non-supervisé des personnes dans les émissions de télévision: une revue du potentiel de chaque modalité, CORIA, 2013. ,
Nommage non supervis?? des personnes dans les ??missions de t??l??vision. Utilisation des noms ??crits, des noms prononc??s ou des deux ?, Documents numériques, 2014. ,
DOI : 10.3166/dn.17.1.37-60
Automatic named identification of speakers using diarization and ASR systems, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.4557-4560, 2009. ,
DOI : 10.1109/ICASSP.2009.4960644
URL : https://hal.archives-ouvertes.fr/hal-00412431
Person instance graphs for mono-, cross- and multi-modal person recognition in multimedia data: application to speaker identification in TV broadcast, IJMIR, 2014. ,
DOI : 10.1109/79.888862
URL : https://hal.archives-ouvertes.fr/hal-01690350
A comparative study using manual and automatic transcriptions for diarization, IEEE Workshop on Automatic Speech Recognition and Understanding, 2005., 2005. ,
DOI : 10.1109/ASRU.2005.1566507
URL : https://www.lrde.epita.fr/~reda/cours/speech/speakerDiarization/1566507.pdf
Who Really Spoke When? Finding Speaker Turns and Identities in Broadcast News Audio, 2006 IEEE International Conference on Acoustics Speed and Signal Processing Proceedings, 2006. ,
DOI : 10.1109/ICASSP.2006.1660195
URL : http://mi.eng.cam.ac.uk/reports/svr-ftp/tranter_icassp06.pdf
Extracting true speaker identities from transcriptions, INTERSPEECH, 2007. ,
Identification of speakers by name using belief functions, IPMU, 2010. ,
Name-It: naming and detecting faces in news videos, IEEE Multimedia, vol.6, issue.1, 1999. ,
DOI : 10.1109/93.752960
URL : http://www.ri.cmu.edu/pub_files/pub2/satoh_s_1999_1/satoh_s_1999_1.pdf
Named Faces: putting names to faces, IEEE Intelligent Systems, vol.14, issue.5, 1999. ,
DOI : 10.1109/5254.796089
Naming every individual in news video monologues, Proceedings of the 12th annual ACM international conference on Multimedia , MULTIMEDIA '04, 2004. ,
DOI : 10.1145/1027527.1027666
URL : http://www.cs.cmu.edu/~juny/Prof/papers/acmmm04a-jyang.pdf
Multiple instance learning for labeling faces in broadcasting news video, Proceedings of the 13th annual ACM international conference on Multimedia , MULTIMEDIA '05, 2005. ,
DOI : 10.1145/1101149.1101155
Video OCR: indexing digital news libraries by recognition of superimposed captions, ACM Multimedia Systems, 1999. ,
DOI : 10.1007/s005300050140
Naming persons in news video with label propagation, 2010 IEEE International Conference on Multimedia and Expo, 2010. ,
DOI : 10.1109/ICME.2010.5583271
Naming People in News Videos with Label Propagation, IEEE Multimedia, vol.18, issue.3, 2011. ,
DOI : 10.1109/MMUL.2011.22
Audiovisual diarization of people in video content, Multimedia Tools and Applications, vol.13, issue.4, p.2012 ,
DOI : 10.1007/978-3-540-68585-2_49
Unsupervised speaker identification using overlaid texts in TV broadcast, INTERSPEECH, 2012. ,
URL : https://hal.archives-ouvertes.fr/hal-00767427
Fusion of Speech, Faces and Text for Person Identification in TV Broadcast, ECCV-IFCVCR, 2012. ,
DOI : 10.1007/978-3-642-33885-4_39
URL : https://hal.archives-ouvertes.fr/hal-00722884
Towards a better integration of written names for unsupervised speakers identification in videos, SLAM-INTERSPEECH, 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-00953089
Unsupervised Speaker Identification in TV Broadcast Based on Written Names, IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2015. ,
DOI : 10.1109/TASLP.2014.2367822
URL : https://hal.archives-ouvertes.fr/hal-01060827
Naming multimodal clusters to identify persons in TV broadcast, 2015. ,
DOI : 10.1007/s11042-015-2723-1
URL : https://hal.archives-ouvertes.fr/hal-01230628
Limsi at mediaeval 2015: Person discovery in broadcast tv task, MediaEval, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01690333
Integer Linear Programming for Speaker Diarization and Cross-Modal Identification in TV Broadcast, IN- TERSPEECH, 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-00953095
Person Instance Graphs for Named Speaker Identification in TV Broadcast, Odyssey, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-01690272
PERCOLI: a person identification system for the 2013 REPERE challenge, SLAM-INTERSPEECH, 2013. ,
Multimodal Understanding for Person Recognition in Video Broadcasts, INTERSPEECH, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-01194244
Scene understanding for identifying persons in TV shows: Beyond face authentication, 2014 12th International Workshop on Content-Based Multimedia Indexing (CBMI), 2014. ,
DOI : 10.1109/CBMI.2014.6849829
URL : https://hal.archives-ouvertes.fr/hal-01194242
Unsupervised face identification in TV content using audio-visual sources, 2013 11th International Workshop on Content-Based Multimedia Indexing (CBMI), 2013. ,
DOI : 10.1109/CBMI.2013.6576591
URL : https://hal.archives-ouvertes.fr/hal-00812334
Comparison of two methods for unsupervised person identification in TV shows, 2014 12th International Workshop on Content-Based Multimedia Indexing (CBMI), 2014. ,
DOI : 10.1109/CBMI.2014.6849828
URL : https://hal.archives-ouvertes.fr/hal-01433260
Speaker, Environment And Channel Change Detection And Clustering Via The Bayesian Information Criterion, DARPA Broadcast News Trans. and Under, 1998. ,
Histograms of Oriented Gradients for Human Detection, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), 2005. ,
DOI : 10.1109/CVPR.2005.177
URL : https://hal.archives-ouvertes.fr/inria-00548512
Face Recognition from Caption-Based Supervision, International Journal of Computer Vision, vol.57, issue.2, p.2012 ,
DOI : 10.1145/1027527.1027689
URL : https://hal.archives-ouvertes.fr/inria-00585834
Facial Landmarks Detector Learned by the Structured Output SVM, VISAPP, 2012. ,
DOI : 10.1007/978-3-642-38241-3_26
Speech Recognition for Machine Translation in Quaero, IWSLT, 2011. ,
Models Cascade for Tree-Structured Named Entity Detection, IJCNLP, 2011. ,
Eumssi team at the mediaeval person discovery challenge, MediaEval, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01433209
Ssig and irisa at multimodal person discovery, MediaEval, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01196171
Percolatte : A multimodal person discovery system in tv broadcast for the medieval 2015 evaluation campaign, MediaEval Lig at mediaeval 2015 multimodal person discovery in broadcast tv task MediaEval, 2015. ,
Gtm-uvigo systems for person discovery task at mediaeval 2015, MediaEval, 2015. ,
Combining audio features and visual i-vector at mediaeval 2015 multimodal person discovery in broadcast tv, MediaEval, 2015. ,
Upc system for the 2015 mediaeval multimodal person discovery in broadcast tv task, MediaEval, 2015. ,
An open-source state-of-the-art toolbox for broadcast news diarization, INTERSPEECH, 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-01433449
Multimodal understanding for person recognition in video broadcasts, INTER- SPEECH, pp.607-611, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-01194244
Robust speaker turn role labeling of TV Broadcast News shows, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.5684-5687, 2011. ,
DOI : 10.1109/ICASSP.2011.5947650
Random forests, Machine Learning, vol.45, issue.1, pp.5-32, 2001. ,
DOI : 10.1023/A:1010933404324
The CAMOMILE Collaborative Annotation Platform for Multi-modal, Multi-lingual and Multi-media Documents, LREC, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01350096
Benchmarking multimedia technologies with the CAMOMILE platform: the case of Multimodal Person Discovery at MediaEval 2015, LREC, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01690277
Estimating average precision with incomplete and imperfect judgments, Proceedings of the 15th ACM international conference on Information and knowledge management , CIKM '06, 2006. ,
DOI : 10.1145/1183614.1183633
URL : http://goanna.cs.rmit.edu.au/~aht/tiger/p102-yilmaz.pdf