On the Information Geometry of Audio Streams with Applications to Similarity Computing - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue IEEE Transactions on Audio, Speech and Language Processing Année : 2011

On the Information Geometry of Audio Streams with Applications to Similarity Computing

Résumé

This paper proposes methods for information processing of audio streams using methods of information geometry. We lay the theoretical groundwork for a framework allowing the treatment of signal information as information entities, suitable for similarity and symbolic computing on audio signals. The theoretical basis of this paper is based on the information geometry of statistical structures representing audio spectrum features, and specifically through the bijection between the generic families of Bregman divergences and that of exponential distributions. The proposed framework, called Music Information Geometry allows online segmentation of audio streams to metric balls where each ball represents a quasi-stationary continuous chunk of audio, and discusses methods to qualify and quantify information between entities for similarity computing. We define an information geometry that approximates a similarity metric space, redefine general notions in music information retrieval such as similarity between entities, and address methods for dealing with non-stationarity of audio signals. We demonstrate the framework on two sample applications for online audio structure discovery and audio matching.
Fichier principal
Vignette du fichier
ACONT_IEEE_MIG2010_2col.pdf (14.72 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00579590 , version 1 (24-03-2011)

Identifiants

Citer

Arshia Cont, Shlomo Dubnov, Gérard Assayag. On the Information Geometry of Audio Streams with Applications to Similarity Computing. IEEE Transactions on Audio, Speech and Language Processing, 2011, 19 (4), pp.837-846. ⟨10.1109/TASL.2010.2066266⟩. ⟨hal-00579590⟩
338 Consultations
269 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More