C. Barras, X. Zhu, S. Meignier, and J. Gauvain, Multi-stage speaker diarization of broadcast news, ASLP, 2006.
URL : https://hal.archives-ouvertes.fr/hal-01434241

L. Canseco, L. Lamel, and J. Gauvain, A comparative study using manual and automatic transcriptions for diarization, IEEE Workshop on Automatic Speech Recognition and Understanding, 2005., 2005.
DOI : 10.1109/ASRU.2005.1566507

M. Charhad, D. Moraru, S. Ayache, and G. Quénot, Speaker identity indexing in audio-visual documents, 2005.
URL : https://hal.archives-ouvertes.fr/hal-00953917

J. Gauvain, L. Lamel, and G. Adda, The LIMSI Broadcast News transcription system, Speech Communication, vol.37, issue.1-2, 2002.
DOI : 10.1016/S0167-6393(01)00061-9

URL : https://hal.archives-ouvertes.fr/hal-01434493

S. E. Tranter, Who Really Spoke When? Finding Speaker Turns and Identities in Broadcast News Audio, 2006 IEEE International Conference on Acoustics Speed and Signal Processing Proceedings, 2006.
DOI : 10.1109/ICASSP.2006.1660195

C. Ma, P. Nguyen, and M. Milind, Finding Speaker Identities with a Conditional Maximum Entropy Model, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07, 2007.
DOI : 10.1109/ICASSP.2007.366899

J. Mauclair, S. Meignier, and Y. Estève, Speaker Diarization: About whom the Speaker is Talking ?, 2006 IEEE Odyssey, The Speaker and Language Recognition Workshop, 2006.
DOI : 10.1109/ODYSSEY.2006.248114

URL : https://hal.archives-ouvertes.fr/hal-01434121

Y. Estève, S. Meignier, P. Deléglise, and J. Mauclair, Extracting true speaker identities from transcriptions. INTERSPEECH, 2007.

V. Jousse, S. Petitrenaud, S. Meignier, Y. Estève, and C. Jacquin, Automatic named identification of speakers using diarization and ASR systems, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, 2009.
DOI : 10.1109/ICASSP.2009.4960644

URL : https://hal.archives-ouvertes.fr/hal-00412431

S. Petitrenaud, V. Jousse, S. Meignier, and Y. Estève, Identification of Speakers by Name Using Belief Functions, IPMU, vol.66, issue.5, 2010.
DOI : 10.1016/0004-3702(94)90026-4

E. El-khoury, A. Laurent, S. Meignier, and S. Petitrenaud, Combining transcription-based and acoustic-based speaker identifications for broadcast news, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2012.
DOI : 10.1109/ICASSP.2012.6288889

F. Bechet, B. Favre, and G. Damnati, Detecting person presence in TV shows with linguistic and structural features, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), p.2012
DOI : 10.1109/ICASSP.2012.6289062

URL : https://hal.archives-ouvertes.fr/hal-01194256

S. Satoh, Y. Nakamura, and T. Kanade, Name-It: naming and detecting faces in news videos, IEEE Multimedia, vol.6, issue.1, 1999.
DOI : 10.1109/93.752960

. R. Houghton, Named Faces: putting names to faces, IEEE Intelligent Systems, vol.14, issue.5, 1999.
DOI : 10.1109/5254.796089

J. Yang and A. G. Hauptmann, Naming every individual in news video monologues, Proceedings of the 12th annual ACM international conference on Multimedia , MULTIMEDIA '04, 2004.
DOI : 10.1145/1027527.1027666

T. Sato, T. Kanade, T. K. Hughes, M. A. Smith, and S. Satoh, Video OCR: indexing digital news libraries by recognition of superimposed captions, Multimedia Systems, vol.7, issue.5, 1999.
DOI : 10.1007/s005300050140

J. Poignant, L. Besacier, G. Quénot, and F. Thollard, From Text Detection in Videos to Person Identification, 2012 IEEE International Conference on Multimedia and Expo, p.2012
DOI : 10.1109/ICME.2012.119

URL : https://hal.archives-ouvertes.fr/hal-00767383

H. Bredin, J. Poignant, M. Tapaswi, G. Fortier, V. B. Le et al., Fusion of Speech, Faces and Text for Person Identification in TV Broadcast, p.2012
DOI : 10.1007/978-3-642-33885-4_39

URL : https://hal.archives-ouvertes.fr/hal-00722884

J. Poignant, H. Bredin, V. B. Le, L. Besacier, C. Barras et al., Unsupervised speaker identification using overlaid texts in tv broadcast, p.2012
URL : https://hal.archives-ouvertes.fr/hal-00767427

J. Poignant, H. Bredin, L. Besacier, G. Quénot, and C. Barras, Towards a better integration of written names for unsupervised speakers identification in videos, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00953089

H. Bredin and J. Poignant, Integer linear programming for speaker diarization and cross-modal identification in tv broadcast, p.2013
URL : https://hal.archives-ouvertes.fr/hal-00953095

N. Dehak, P. Kenny, R. Dehak, P. Dumouchel, and P. Ouellet, Front-End Factor Analysis for Speaker Verification, IEEE Transactions on Audio, Speech, and Language Processing, vol.19, issue.4, 2011.
DOI : 10.1109/TASL.2010.2064307

J. Poignant, L. Besacier, and G. Quénot, Nommage non-supervisé des personnes dans lesémissionsles´lesémissions de télévision : une revue du potentiel de chaque modalité, 2013.

J. Poignant, L. Besacier, V. B. Le, S. Rosset, and G. Quénot, Unsupervised naming of speakers in broadcast TV: using written names, pronounced names or both ?, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00953088

J. Kahn, O. Galibert, L. Quintard, M. Carré, A. Giraudel et al., A presentation of the REPERE challenge, 2012 10th International Workshop on Content-Based Multimedia Indexing (CBMI), p.2012
DOI : 10.1109/CBMI.2012.6269851

A. Giraudel, M. Carré, V. Mapelli, J. Kahn, O. Galibert et al., The REPERE corpus: a multi-modal corpus for person recognition, p.2012

S. S. Chen, P. S. Gopalakrishnan, and . Speaker, environment and channel change detection and clustering via the bayesian information criterion, DARPA Broadcast News Transcription and Understanding Workshop, 1998.

C. Barras, X. Zhu, S. Meignier, and J. Gauvain, Multi-stage speaker diarization of broadcast news, ASLP, 2006.
URL : https://hal.archives-ouvertes.fr/hal-01434241

M. Rouvier and S. Meignier, A Global Optimization Framework For Speaker Diarization, 2012.
URL : https://hal.archives-ouvertes.fr/hal-01433467

J. Gauvain, L. Lamel, and G. Adda, Partitioning and transcription of broadcast news data, ICSLP, 1998.

M. Dinarelli and S. Rosset, Models Cascade for Tree-Structured Named Entity Detection, IJCNLP, 2011.

A. Allauzen and H. Bonneau-maynard, Training and evaluation of pos taggers on the French multitag corpus, LREC, 2008.

H. W. Kuhn, The hungarian method for the assignment problem, Naval Research Logistics Quarterly, 1955.

W. M. Campbell, D. E. Sturim, and D. A. Reynolds, Support Vector Machines Asing GMM Supervectors for Speaker Verification, Signal Processing Letters, 2006.

V. B. Le, C. Barras, and M. Ferràs, On the use of gsv-svm for speaker diarization and tracking, 2010.