A General Audio Semantic Classifier based on human perception motivated mode

Hadi Harb; Liming Chen

doi:10.1007/s11042-007-0108-9

Article Dans Une Revue Multimedia Tools and Applications Année : 2007

A General Audio Semantic Classifier based on human perception motivated mode

(1) , (1)

Hadi Harb

Fonction : Auteur

Laboratoire d'InfoRmatique en Image et Systèmes d'information

Liming Chen

Fonction : Auteur
PersonId : 7562
IdHAL : liming-chen
IdRef : 067400175

Laboratoire d'InfoRmatique en Image et Systèmes d'information

Résumé

The audio channel conveys rich clues for content-based multimedia indexing. Interesting audio analysis includes, besides widely known speech recognition and speaker identification problems, speech/music segmentation, speaker gender detection, special effect recognition such as gun shots or car pursuit, and so on. All these problems can be considered as an audio classification problem which needs to generate a label from low audio signal analysis. While most audio analysis techniques in the literature are problem specific, we propose in this paper a general framework for audio classification. The proposed technique uses a perceptually motivated model of the human perception of audio classes in the sense that it makes a judicious use of certain psychophysical results and relies on a neural network for classification. In order to assess the effectiveness of the proposed approach, large experiments on several audio classification problems have been carried out, including speech/music discrimination in Radio/TV programs, gender recognition on a subset of the switchboard database, highlights detection in sports videos, and musical genre recognition. The classification accuracies of the proposed technique are comparable to those obtained by problem specific techniques while offering the basis of a general approach for audio classification.

Domaines

Informatique [cs]

Équipe gestionnaire des publications SI LIRIS : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01589771

Soumis le : mardi 19 septembre 2017-09:20:56

Dernière modification le : mercredi 5 juillet 2023-15:28:04

Dates et versions

hal-01589771 , version 1 (19-09-2017)

Identifiants

HAL Id : hal-01589771 , version 1
DOI : 10.1007/s11042-007-0108-9

Citer

Hadi Harb, Liming Chen. A General Audio Semantic Classifier based on human perception motivated mode. Multimedia Tools and Applications, 2007, 3, 34, pp.375-395. ⟨10.1007/s11042-007-0108-9⟩. ⟨hal-01589771⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS UNIV-LYON1 UNIV-LYON2 INSA-LYON EC-LYON LIRIS LABEXIMU INSA-GROUPE UDL EC_LYON_STRICT

60 Consultations

0 Téléchargements

A General Audio Semantic Classifier based on human perception motivated mode

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager