S. Arora, E. Nyberg, and C. P. Rosé, Estimating annotation cost for active learning in a multi-annotator environment, Proceedings of the NAACL HLT 2009 Workshop on Active Learning for Natural Language Processing, pp.18-26, 2009.

C. Barras, E. Geoffrois, Z. Wu, and M. Liberman, Transcriber: development and use of a tool for assisting speech corpora production, Speech Communication, vol.33, issue.1, pp.5-22, 2001.
URL : https://hal.archives-ouvertes.fr/hal-01690349

T. Bazillon, Y. Estève, and D. Luzzati, Transcription manuelle vs assistée de la parole préparé et spontanée, 2008.

J. Bonastre, P. Delacourt, C. Fredouille, T. Merlin, and C. Wellekens, A speaker tracking system based on speaker turn detection for nist evaluation, Acoustics, Speech, and Signal Processing, 2000. ICASSP'00. Proceedings. 2000 IEEE International Conference on, vol.2, pp.1177-1180, 2000.
DOI : 10.1109/icassp.2000.859175

P. Broux, D. Doukhan, S. Petitrenaud, S. Meignier, C. et al., An active learning method for speaker identity annotation in audio recordings, 1st International Workshop on Multimodal Media Data Analytics (MMDA), 2016.
URL : https://hal.archives-ouvertes.fr/hal-01451532

, European Conference on Artificial Intelligence (ECAI)

M. Budnik, J. Poignant, L. Besacier, and G. Quénot, Automatic propagation of manual annotations for multimodal person identification in tv shows, Content-Based Multimedia Indexing (CBMI), pp.1-4, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01002927

M. Charhad, D. Moraru, S. Ayache, and G. Quénot, Speaker identity indexing in audio-visual documents, Content-Based Multimedia Indexing (CBMI2005), 2005.
URL : https://hal.archives-ouvertes.fr/hal-00953917

R. Dufour, V. Jousse, Y. Estève, F. Béchet, and G. Linarès, Spontaneous speech characterization and detection in large audio database, 2009.
URL : https://hal.archives-ouvertes.fr/hal-01433943

O. Galibert, Methodologies for the evaluation of speaker diarization and automatic speech recognition in the presence of overlapping speech, INTERSPEECH, pp.1131-1134, 2013.

J. Kahn, O. Galibert, L. Quintard, M. Carré, A. Giraudel et al., A presentation of the repere challenge, Content-Based Multimedia Indexing (CBMI), pp.1-6, 2012.

I. A. Mccowan, D. Moore, J. Dines, D. Gatica-perez, M. Flynn et al., On the use of information retrieval measures for speech recognition evaluation, 2004.

S. Meignier and T. Merlin, Lium spkdiarization: an open source toolkit for diarization, CMU SPUD Workshop, 2010.
URL : https://hal.archives-ouvertes.fr/hal-01433518

N. , The rich transcription spring 2003 (RT-03S) evaluation plan, 2003.

R. Ordelman, F. De-jong, and M. Larson, Enhanced multimedia content access and exploitation using semantic speech retrieval, Semantic Computing, 2009. ICSC'09. IEEE International Conference on, pp.521-528, 2009.

M. Snover, B. Dorr, R. Schwartz, L. Micciulla, and J. Makhoul, A study of translation edit rate with targeted human annotation, Proceedings of association for machine translation in the Americas, 0200.

F. Vallet, J. Uro, J. Andriamakaoly, H. Nabi, M. Derval et al., Speech trax: A bottom to the top approach for speaker tracking and indexing in an archiving context, Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC). European Language Resources Association (ELRA), 2016.

P. Wittenburg, H. Brugman, A. Russel, A. Klassmann, and H. Sloetjes, Elan: a professional framework for multimodality research, Proceedings of LREC, p.5, 2006.

M. E. Wood and E. Lewis, Windmill-the use of a parsing algorithm to produce predictions for disabled persons, PROCEEDINGS-INSTITUTE OF ACOUSTICS, vol.18, pp.315-322, 1996.