Parametric audio coding with exponentially damped sinusoids

Abstract : Sinusoidal modeling is one of the most popular techniques for low bitrate audio coding. Usually, the sinusoidal parameters (amplitude, pulsation and phase of each sinusoidal component) are kept constant within a time segment. An alternative model, the so-called Exponentially-Damped Sinusoidal (EDS) model, includes an additional damping parameter for each sinusoidal component to better represent the signal characteristics. It was however never shown that the EDS model could be efficient for perceptual audio coding. To that aim, we propose in this paper an efficient analysis/synthesis framework with dynamic time-segmentation on transients and psychoacoustic modeling, and an asymptotically optimal entropy-constrained quantization method for the four sinusoid parameters (e.g including damping). We then apply this coding technique to real audio excerpts for a given entropy target corresponding to a low bitrate (20 kbits/s), and compare this method with a classical sinusoidal coding scheme using a constant-amplitude sinusoidal model and the perceptually weighted Matching Pursuit algorithm. Subjective listening tests show that the EDS model is more efficient on audio samples with fast transient content, and similar to the classical model for more stationary audio samples.
Type de document :
Article dans une revue
IEEE Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2013, 21 (7), pp.1489-1501. 〈10.1109/TASL.2013.2255284〉
Liste complète des métadonnées

Littérature citée [28 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-00881698
Contributeur : Olivier Derrien <>
Soumis le : vendredi 8 novembre 2013 - 17:44:45
Dernière modification le : mardi 17 avril 2018 - 16:32:09
Document(s) archivé(s) le : dimanche 9 février 2014 - 09:50:12

Fichier

Derrien_SLAP13.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

Citation

Olivier Derrien, Roland Badeau, Gaël Richard. Parametric audio coding with exponentially damped sinusoids. IEEE Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2013, 21 (7), pp.1489-1501. 〈10.1109/TASL.2013.2255284〉. 〈hal-00881698〉

Partager

Métriques

Consultations de la notice

311

Téléchargements de fichiers

205