L. Canseco, L. Lamel, and J. Gauvain, A comparative study using manual and automatic transcriptions for diarization, IEEE Workshop on Automatic Speech Recognition and Understanding, 2005., pp.415-419, 2005.
DOI : 10.1109/ASRU.2005.1566507

URL : https://www.lrde.epita.fr/~reda/cours/speech/speakerDiarization/1566507.pdf

S. E. Tranter, Who Really Spoke When? Finding Speaker Turns and Identities in Broadcast News Audio, 2006 IEEE International Conference on Acoustics Speed and Signal Processing Proceedings, pp.1013-1016, 2006.
DOI : 10.1109/ICASSP.2006.1660195

URL : http://mi.eng.cam.ac.uk/reports/svr-ftp/tranter_icassp06.pdf

J. Mauclair, S. Meignier, and Y. Estève, Speaker Diarization: About whom the Speaker is Talking ?, 2006 IEEE Odyssey, The Speaker and Language Recognition Workshop, 2006.
DOI : 10.1109/ODYSSEY.2006.248114

URL : https://hal.archives-ouvertes.fr/hal-01434121

Y. Estève, S. Meignier, P. Deléglise, and J. Mauclair, Extracting true speaker identities from transcriptions, Proceedings of the International Speech Communication Association, pp.2601-2604, 2007.

V. Jousse, S. Petitrenaud, S. Meignier, Y. Estève, and C. Jacquin, Automatic named identification of speakers using diarization and ASR systems, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, 2009.
DOI : 10.1109/ICASSP.2009.4960644

URL : https://hal.archives-ouvertes.fr/hal-00412431

H. Bredin and J. Poignant, Integer Linear Programming for Speaker Diarization and Cross-Modal Identification in TV Broadcast, Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00953095

E. El-khoury, A. Laurent, S. Meignier, and S. Petitrenaud, Combining transcription-based and acoustic-based speaker identifications for broadcast news, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.4377-4380, 2012.
DOI : 10.1109/ICASSP.2012.6288889

L. Lamel, S. Courcinous, J. Despres, J. Gauvain, Y. Josse et al., Speech Recognition for Machine Translation in Quaero, International Workshop on Spoken Language Translation, 2011.

J. Gauvain, L. Lamel, and G. Adda, Partitioning and Transcription of Broadcast News Data, Proceedings of International Conference on Spoken Language Processing, pp.1335-1338, 1998.

T. Lavergne, O. Cappé, and F. Yvon, Practical Very Large Scale CRFs, Proceedings the 48 th Annual Meeting of the Association for Computational Linguistics, pp.504-513, 2010.

M. Dinarelli and S. Rosset, Models Cascade for Tree-Structured Named Entity Detection, Proceedings of 5th International Joint Conference on Natural Language Processing Asian Federation of Natural Language Processing, pp.1269-1278, 2011.

S. Scott, P. Chen, and . Gopalakrishnan, Speaker, Environment and Channel Change Detection and Clustering via the Bayesian Information Criterion, DARPA Broadcast News Transcription and Understanding Workshop, 1998.

N. Dehak, P. Kenny, R. Dehak, P. Dumouchel, and P. Ouellet, Front-End Factor Analysis for Speaker Verification, IEEE Transactions on Audio, Speech, and Language Processing, vol.19, issue.4, pp.788-798, 2011.
DOI : 10.1109/TASL.2010.2064307

URL : http://www.sls.lcs.mit.edu/sls/publications/2011/Dehak_IEEE_May2011.pdf

J. D. Simon and . Prince, Computer Vision: Models Learning and Inference, 2012.

M. Senoussaoui, P. Kenny, N. Brümmer, P. Edward-de-villiers, and . Dumouchel, Mixture of PLDA Models in I-Vector Space for Gender-Independent Speaker Recognition, Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011.

J. R. , F. , and C. D. Manning, Enforcing Transitivity in Coreference Resolution, Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2008.

G. Optimization and . Inc, Gurobi Optimizer Reference Manual

H. Bredin, A. Roy, V. Le, and C. Barras, Person instance graphs for mono-, cross- and multi-modal person recognition in multimedia data: application to speaker identification in TV broadcast, International Journal of Multimedia Information Retrieval, vol.17, issue.6, 2014.
DOI : 10.1109/79.888862

URL : https://hal.archives-ouvertes.fr/hal-01690350

A. Giraudel, M. Carré, V. Mapelli, J. Kahn, O. Galibert et al., The REPERE Corpus: a Multimodal Corpus for Person Recognition, International Conference on Language Resources and Evaluation, 2012.

J. Bergstra and Y. Bengio, Random Search for Hyper-Parameter Optimization, Journal of Machin Learning Research, vol.13, pp.281-305, 2012.

H. Hermansky, Perceptual linear predictive (PLP) analysis of speech, The Journal of the Acoustical Society of America, vol.87, issue.4, pp.1738-1752, 1990.
DOI : 10.1121/1.399423

J. Pelecanos and S. Sridharan, Feature Warping for Robust Speaker Verification, Proceedings of Odyssey 2001 -The Speaker Recognition Workshop, pp.213-218, 2001.

G. Gravier, G. Adda, N. Paulson, M. Carré, A. Giraudel et al., The ETAPE Corpus for the Evaluation of Speech-based TV Content processing in the French language, International Conference on Language Resources, Evaluation and Corpora, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00712591

P. Bousquet, D. Matrouf, and J. Bonastre, Intersession Compensation and Scoring Methods in the i-vectors Space for Speaker Recognition, Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011.
URL : https://hal.archives-ouvertes.fr/hal-01313266

G. Dupuy, S. Meignier, P. Deléglise, and Y. Estève, Recent Improvements towards ILPbased Clustering for Broadcast News Speaker Diarization, Proceedings of Odyssey 2014 -The Speaker and Language Recognition Workshop, 2014.

N. Zahi, W. M. Karam, and . Campbell, Graph Embedding for Speaker Recognition, Graph Embedding for Pattern Analysis, pp.229-260, 2013.