M. Bostock, Data-Driven Documents, 2014.

J. Poignant, H. Bredin, V. Le, L. Besacier, C. Barras et al., Unsupervised speaker identification using overlaid texts in tv broadcast, Proceedings of the 13th Annual Conference of the International Speech Communication Association (Interspeech), 2012.
URL : https://hal.archives-ouvertes.fr/hal-00767427

M. Budnik, J. Poignant, L. Besacier, and G. Quénot, Automatic propagation of manual annotations for multimodal person identification in TV shows, 2014 12th International Workshop on Content-Based Multimedia Indexing (CBMI), 2014.
DOI : 10.1109/CBMI.2014.6849849
URL : https://hal.archives-ouvertes.fr/hal-01002927

C. Barras, X. Zhu, S. Meignier, and J. Gauvain, Multistage speaker diarization of broadcast news, Audio, Speech, and Language Processing, pp.1505-1512, 2006.
DOI : 10.1109/TASL.2006.878261
URL : https://hal.archives-ouvertes.fr/hal-01434241

A. Giraudel, M. Carré, V. Mapelli, J. Kahn, O. Galibert et al., The repere corpus: a multimodal corpus for person recognition, LREC, pp.1102-1107, 2012.