, Handbook on the implementation of recommendation cm/rec(2013)1 of the committee of ministers of the council of europe on gender equality and media, 2015.

M. Reiser and B. Gresy, L'image des femmes dans les médias, 2008.

, La représentation des femmesàfemmes`femmesà la télévision etàet`età la radio-rapport sur l'exercice 2016, Conseil supérieur de l'audiovisuel (CSA), 2017.

. Sarah, Who Makes the News?: Global Media Monitoring Project 2015, World Association for Christian Communication, 2015.

E. Pépiot, Voice, speech and gender:. malefemale acoustic differences and cross-language variation in english and french speakers, Corela. Cognition, 2015.

F. Lori, J. Lamel, and . Gauvain, A phone-based approach to non-linguistic speech feature identification, Computer Speech & Language, vol.9, issue.1, pp.87-103, 1995.

T. Bocklet, A. Maier, J. G. Bauer, F. Burkhardt, and E. Noth, Age and gender recognition for telephone applications based on gmm supervectors and support vector machines, Acoustics, Speech and Signal Processing, pp.1605-1608, 2008.

R. Xia, J. Deng, B. Schuller, and Y. Liu, Modeling gender information for emotion recognition using denoising autoencoder, Acoustics, Speech and Signal Processing (ICASSP), pp.990-994, 2014.

L. E. Shafey, E. Khoury, and S. , Audio-visual gender recognition in uncontrolled environment using variability modeling techniques, International Joint Conference on Biometrics (IJCB)

, IEEE, pp.1-8, 2014.

N. Dehak, J. Patrick, R. Kenny, P. Dehak, P. Dumouchel et al., Front-end factor analysis for speaker verification, IEEE Transactions on Audio, Speech, and Language Processing, vol.19, issue.4, pp.788-798, 2011.

J. Royo-letelier, R. Hennequin, and M. Moussallam, Detection and characterization of singing voice using deep neural netwo rks, 2015.

F. Salmon and F. Vallet, An effortless way to create large-scale datasets for famous speakers, LREC, pp.348-352, 2014.

F. Vallet, J. Uro, J. Andriamakaoly, H. Nabi, M. Derval et al., , 2016.

A. Nagrani, J. S. Chung, and A. Zisserman, Voxceleb: A large-scale speaker identification dataset, Proc. Interspeech, pp.2616-2620, 2017.

S. Meignier and T. Merlin, Lium spkdiarization: an open source toolkit for diarization, CMU SPUD Workshop, 2010.
URL : https://hal.archives-ouvertes.fr/hal-01433518

J. Poignant, L. Besacier, G. Quénot, and F. Thollard, From text detection in videos to person identification, Multimedia and Expo (ICME)
URL : https://hal.archives-ouvertes.fr/hal-00767383

, IEEE, pp.854-859, 2012.

D. Doukhan and J. Carrive, Investigating the Use of Semi-Supervised Convolutional Neural Network Models for Speech/Music Classification and Segmentation, The Ninth International Conferences on Advances in Multimedia, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01514228

A. Larcher, S. Kong-aik-lee, and . Meignier, An extensible speaker identification sidekit in python, Acoustics, Speech and Signal Processing
DOI : 10.1109/icassp.2016.7472648

URL : https://hal.archives-ouvertes.fr/hal-01433157

, IEEE, pp.5095-5099, 2016.

J. Pelecanos and S. Sridharan, Feature warping for robust speaker verification, 2001.

S. Meignier and A. Larcher, S4d: Sidekit for speaker diarization, 2015.

F. Chollet, Keras, 2015.

Y. Qian, M. Bi, T. Tan, and K. Yu, Very deep convolutional neural networks for noise robust speech recognition, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.24, issue.12, pp.2263-2276, 2016.
DOI : 10.1109/taslp.2016.2602884

A. Giraudel, M. Carré, V. Mapelli, J. Kahn, O. Galibert et al., The repere corpus: a multimodal corpus for person recognition, LREC, pp.1102-1107, 2012.

D. Doukhan, G. Poels, and J. Carrive, Describing gender equality in french audiovisual streams with a deep learning approach (submitted), Journal of European Television History and Culture, 2018.

D. Gaël-le-lan, A. Charlet, S. Larcher, and . Meignier, A triplet ranking-based neural network for speaker diarization and linking, Proc. Interspeech 2017, pp.3572-3576, 2017.