Long-Term Flexible 2D Cepstral Modeling of Speech Spectral Amplitudes - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2008

Long-Term Flexible 2D Cepstral Modeling of Speech Spectral Amplitudes

Résumé

This paper presents a method for modeling the envelope of spectral amplitude parameters of speech signals in "two dimensions" (2D). It consists of two cascaded modelings: the first one along the frequency axis is the usual cepstrum technique, which consists of modeling the log-scaled spectral envelope with a Discrete Cosine Model (DCM). The second one, along the time axis, consists of modeling the trajectory of the envelope DCM coefficients by another similar DCM model. An iterative algorithm is proposed to optimally fit this 2D-model to the data according to a perceptual criterion based on frequency masking. This approach is shown to provide an efficient and flexible representation of spectral amplitude parameters in terms of coefficient rates, while providing good signal quality, opening new perspectives in very-low bit-rate sinusoidal speech coding.
Fichier non déposé

Dates et versions

hal-00329752 , version 1 (13-10-2008)

Identifiants

  • HAL Id : hal-00329752 , version 1

Citer

Laurent Girin, Mohammad Firouzmand. Long-Term Flexible 2D Cepstral Modeling of Speech Spectral Amplitudes. ICASSP 2008 - IEEE International Conference on Acoustics, Speech and Signal Processing, Mar 2008, Las Vegas, Nevada, United States. ⟨hal-00329752⟩
64 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More