Estimation of Speech Lip Features from Discrete Cosinus Transform
Résumé
This study is a contribution to the field of visual speech processing. It focuses on the automatic extraction of Speech lip features from natural lips. The method is based on the direct prediction of these features from predictors derived from an adequate transformation of the pixels of the lip region of interest. The transformation is made of a 2-D Discrete Cosine Transform combined with a Principal Component Analysis applied to a subset of the DCT coefficients corresponding to about 1% of the total DCTs. The results show the possibility to estimate the geometric lip feature with a good accuracy (a root mean square of 1 to 1.4 mm for the lip aperture and the lip width) using a reduce set of predictors derived from the PCA.
Fichier principal
zuhengBeautempsGangSchmerber_Interspeech2010_v_3004_21h.pdf (120.67 Ko)
Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...