Skip to Main content Skip to Navigation
Journal articles

Voice-Based Gender Identification in multimedia applications

Abstract : In the context of content-based multimedia indexing gender identification based on speech signal is an important task. In this paper a set of acoustic and pitch features along with different classifiers are compared for the problem of gender identification. We show that the fusion of features and classifiers performs better than any individual classifier. Based on such conclusions we built a system for gender identification in multimedia applications. The system uses a set of Neural Networks with acoustic and Pitch related features. 90% of classification accuracy is obtained for 1 second segments and with independence to the language and the channel of the speech. Practical considerations, such as the continuity of speech and the use of mixture of experts instead of one single expert are shown to improve the classification accuracy to 93%. When used on a subset of the Switchboard database, the classification accuracy attains 98.5% for 5 seconds segments.
Document type :
Journal articles
Complete list of metadata
Contributor : Équipe gestionnaire des publications SI LIRIS Connect in order to contact the contributor
Submitted on : Wednesday, September 13, 2017 - 4:46:50 PM
Last modification on : Tuesday, June 1, 2021 - 2:08:05 PM

Links full text



Hadi Harb, Liming Chen. Voice-Based Gender Identification in multimedia applications. Journal of Intelligent Information Systems, Springer Verlag, 2005, 2-3, 24, pp.179-198. ⟨10.1007/s10844-005-0322-8⟩. ⟨hal-01587130⟩



Record views