A high-capacity watermarking technique for audio signals based on MDCT-domain quantization

Jonathan Pinel; Laurent Girin; Cléo Baras; Mathieu Parvaix

Communication Dans Un Congrès Année : 2010

A high-capacity watermarking technique for audio signals based on MDCT-domain quantization

(1) , (2) , (1) , (2)

1
2

Jonathan Pinel

Fonction : Auteur
PersonId : 881863

GIPSA - Communication Information and Complex Systems

Laurent Girin

Fonction : Auteur
PersonId : 3682
IdHAL : laurent-girin
ORCID : 0000-0002-9214-8760
IdRef : 088998037

GIPSA - Machines parlantes, Gestes oro-faciaux, Interaction Face-à-face, Communication augmentée

Cléo Baras

Fonction : Auteur
PersonId : 737438
IdHAL : cleo-baras

GIPSA - Communication Information and Complex Systems

Mathieu Parvaix

Fonction : Auteur
PersonId : 871521

GIPSA - Machines parlantes, Gestes oro-faciaux, Interaction Face-à-face, Communication augmentée

Résumé

Watermarking is a technique that consists in hiding/embedding binary information within a signal in an imperceptibly way, meaning in the present context of audio signals that the mark is inaudible. Watermarking was first used for the protection of digital contents as part of the DRM (Digital Rights Management). In this context of secured applications, important efforts were devoted to ensure robustness of watermarks against pirate attacks aiming at neutralizing it rather than improving the quantity of watermarked information; the bitrate was usually within the range of tens of bits per second bps for audio signals. Nowadays, audio watermarking can be used for other kinds of applications, and in particular for metadata transmission. However, bitrates are usually still quite low, although such applications require extended bitrates balanced with lower robustness. In this study we propose a high-capacity watermarking technique for audio signals. This technique is suitable for many uncompressed audio signals, more particularly for 16-bit Pulse Coded Modulation (PCM) signals as widely used in audio-CD and wav formats. The proposed technique is based on the application of the Quantization Index Modulation (QIM) technique on the MDCT (Modified Discrete Cosine Transform) coefficients of the signal. The underlying basic principle is that, if those coefficients can be significantly modified by quantization in audio compression schemes such as MPEG MP3/AAC without quality impairments, they can also be modified to embed watermark codes. Following audio compression principles, a psychoacoustic model (PAM) is used at the watermark embedder to take into consideration the behavior of the human auditory system and match the inaudibility constraint. The PAM is used to estimate an optimal watermarking capacity for each sub-band of each MDCT frame. The resulting capacity values are transmitted as (watermarked) side-information to the decoder (so that the decoder can retrieve the usefull watermarked information in the corresponding sub-band). For this aim, specific fixed capacities are allotted in the higher sub-band of the spectrum. With this technique, maximal bitrates of about 250kbps per audio channel can be reached (depending on the audio content), at the expense of robustness: the system can be used for "non-secure" applications where the signal suffers any attack other than quantization for uncompressed format conversion. For instance, we use this technique in a watermark-informed source separation system presented at the same congress.

Mots clés

psychoacoustic model data hiding watermarking audio processing

Domaines

Traitement du signal et de l'image [eess.SP] Traitement du signal et de l'image [eess.SP]

Laurent Girin : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00534502

Soumis le : mardi 9 novembre 2010-18:58:04

Dernière modification le : jeudi 4 avril 2024-18:19:18

Dates et versions

hal-00534502 , version 1 (09-11-2010)

Identifiants

HAL Id : hal-00534502 , version 1

Citer

Jonathan Pinel, Laurent Girin, Cléo Baras, Mathieu Parvaix. A high-capacity watermarking technique for audio signals based on MDCT-domain quantization. ICA 2010 - 20th International Congress on Acoustics, Aug 2010, Sydney, Australia. pp.ICA2010. ⟨hal-00534502⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA CNRS GIPSA GIPSA-DIS GIPSA-DPC GIPSA-MAGIC GIPSA-CICS

266 Consultations

0 Téléchargements

A high-capacity watermarking technique for audio signals based on MDCT-domain quantization

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager