Voice-Based Gender Identification in multimedia applications

Abstract : In the context of content-based multimedia indexing gender identification based on speech signal is an important task. In this paper a set of acoustic and pitch features along with different classifiers are compared for the problem of gender identification. We show that the fusion of features and classifiers performs better than any individual classifier. Based on such conclusions we built a system for gender identification in multimedia applications. The system uses a set of Neural Networks with acoustic and Pitch related features. 90% of classification accuracy is obtained for 1 second segments and with independence to the language and the channel of the speech. Practical considerations, such as the continuity of speech and the use of mixture of experts instead of one single expert are shown to improve the classification accuracy to 93%. When used on a subset of the Switchboard database, the classification accuracy attains 98.5% for 5 seconds segments.
Document type :
Journal articles
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-01587130
Contributor : Équipe Gestionnaire Des Publications Si Liris <>
Submitted on : Wednesday, September 13, 2017 - 4:46:50 PM
Last modification on : Thursday, November 21, 2019 - 2:23:44 AM

Links full text

Identifiers

Citation

Hadi Harb, Liming Chen. Voice-Based Gender Identification in multimedia applications. Journal of Intelligent Information Systems, Springer Verlag, 2005, 2-3, 24, pp.179-198. ⟨10.1007/s10844-005-0322-8⟩. ⟨hal-01587130⟩

Share

Metrics

Record views

912