Multichannel Object-Based Audio Coding with Controllable Quality

Stanislaw Gorlow 1, * Emanuël Habets 2 Sylvain Marchand 3
* Corresponding author
3 Lab-STICC_UBO_CID_IHSEV
Lab-STICC - Laboratoire des sciences et techniques de l'information, de la communication et de la connaissance
Abstract : In this paper a new multichannel object-based audio coding scheme with scalable signal quality is proposed. The novel scheme is based on controlled downmixing and demixing. By means of a dedicated control mechanism, a number of distinct audio objects are mixed into a lower number of channels. The latter is chosen such that the desired quality level is met after demixing. The quality is assessed with two new psychoacoustically motivated metrics. Following the informed source separation approach, the downmix is decomposed via optimum spatial filtering guided by short-time power spectral densities of the audio objects. In an experiment it is shown that the raw data rate of an exemplary 10-track recording can be reduced by at least 30 % using linear pulse-code modulation while maintaining perceptual transparency.
Document type :
Conference papers
Liste complète des métadonnées

Cited literature [13 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-00806382
Contributor : Stanislaw Gorlow <>
Submitted on : Wednesday, June 12, 2013 - 2:11:57 PM
Last modification on : Monday, February 25, 2019 - 3:14:11 PM
Document(s) archivé(s) le : Friday, September 13, 2013 - 2:45:13 AM

File

Gorlow_ICASSP2013.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00806382, version 1

Citation

Stanislaw Gorlow, Emanuël Habets, Sylvain Marchand. Multichannel Object-Based Audio Coding with Controllable Quality. 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2013), May 2013, Vancouver, Canada. pp.561-565. ⟨hal-00806382⟩

Share

Metrics

Record views

221

Files downloads

397