Multichannel Object-Based Audio Coding with Controllable Quality - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2013

Multichannel Object-Based Audio Coding with Controllable Quality

Résumé

In this paper a new multichannel object-based audio coding scheme with scalable signal quality is proposed. The novel scheme is based on controlled downmixing and demixing. By means of a dedicated control mechanism, a number of distinct audio objects are mixed into a lower number of channels. The latter is chosen such that the desired quality level is met after demixing. The quality is assessed with two new psychoacoustically motivated metrics. Following the informed source separation approach, the downmix is decomposed via optimum spatial filtering guided by short-time power spectral densities of the audio objects. In an experiment it is shown that the raw data rate of an exemplary 10-track recording can be reduced by at least 30 % using linear pulse-code modulation while maintaining perceptual transparency.

Domaines

Son [cs.SD]
Fichier principal
Vignette du fichier
Gorlow_ICASSP2013.pdf (295.4 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00806382 , version 1 (12-06-2013)

Identifiants

  • HAL Id : hal-00806382 , version 1

Citer

Stanislaw Gorlow, Emanuël A. P. Habets, Sylvain Marchand. Multichannel Object-Based Audio Coding with Controllable Quality. 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2013), May 2013, Vancouver, Canada. pp.561-565. ⟨hal-00806382⟩
175 Consultations
505 Téléchargements

Partager

Gmail Facebook X LinkedIn More