Gender Identification Using A General Audio Classifier

Abstract : In the context of content-based multimedia indexing gender identification using speech signal is an important task. Existing techniques are dependent on the quality of the speech signal making them unsuitable for the video indexing problems. In this paper we introduce a novel gender identification approach based on a general audio classifier. The audio classifier models the audio signal by the first order spectrum's statistics in 1s windows and uses a set of neural networks as classifiers. The presented technique shows robustness to adverse audio compression and it is language independent. We show how practical considerations about the speech in audio-visual data, such as the continuity of speech, can further improve the classification results which attain 92%.
Document type :
Conference papers
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-01587090
Contributor : Équipe Gestionnaire Des Publications Si Liris <>
Submitted on : Wednesday, September 13, 2017 - 4:11:40 PM
Last modification on : Thursday, November 21, 2019 - 1:46:09 AM

Identifiers

Citation

Hadi Harb, Liming Chen. Gender Identification Using A General Audio Classifier. International Conference on Multimedia & Expo, ICME 2003, Jul 2003, Baltimore, United States. ⟨10.1109/ICME.2003.1221721⟩. ⟨hal-01587090⟩

Share

Metrics

Record views

115