AN OPEN-SOURCE SPEAKER GENDER DETECTION FRAMEWORK FOR MONITORING GENDER EQUALITY - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2018

AN OPEN-SOURCE SPEAKER GENDER DETECTION FRAMEWORK FOR MONITORING GENDER EQUALITY

Résumé

This paper presents an approach based on acoustic analysis to describe gender equality in French audiovisual streams, through the estimation of male and female speaking time. Gender detection systems based on Gaussian Mixture Models , i-vectors and Convolutional Neural Networks (CNN) were trained using an internal database of 2,284 French speakers and evaluated using REPERE challenge corpus. The CNN system obtained the best performance with a frame-level gender detection F-measure of 96.52 and a hourly gender speaking time percentage error bellow 0.6%. It was considered reliable enough to realize large-scale gender equality descriptions. The proposed gender detection system has been packaged as an open-source framework.
Fichier principal
Vignette du fichier
ddoukhan_icassp_2018.pdf (137.16 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01927560 , version 1 (19-11-2018)

Identifiants

  • HAL Id : hal-01927560 , version 1

Citer

David Doukhan, Jean Carrive, Félicien Vallet, Anthony Larcher, Sylvain Meignier. AN OPEN-SOURCE SPEAKER GENDER DETECTION FRAMEWORK FOR MONITORING GENDER EQUALITY. IEEE International Conference on Acoustic Speech and Signal Processing, Apr 2018, Calgary, Canada. ⟨hal-01927560⟩
279 Consultations
2650 Téléchargements

Partager

Gmail Facebook X LinkedIn More