A Comparative Study of Glottal Source Estimation Techniques

Thomas Drugman; Baris Bozkurt; T. Dutoit

doi:10.1016/j.csl.2011.03.003

Article Dans Une Revue Computer Speech and Language Année : 2011

A Comparative Study of Glottal Source Estimation Techniques

, (1) , (2)

1
2

Thomas Drugman

Fonction : Auteur correspondant
PersonId : 931988

Connectez-vous pour contacter l'auteur

Baris Bozkurt

Fonction : Auteur

Department of Electrical & Electronics Engineering

T. Dutoit

Fonction : Auteur

University of Mons [Belgium]

Résumé

Source-tract decomposition (or glottal ow estimation) is one of the basic problems of speech processing. For this, several techniques have been proposed in the literature. However studies comparing difierent approaches are almost nonexistent. Besides, experiments have been systematically performed either on synthetic speech or on sustained vowels. In this study we compare three of the main representative state-of-the-art methods of glottal ow estimation: closed-phase inverse _ltering, iterative and adaptive inverse _ltering, and mixed-phase decomposition. These techniques are _rst submitted to an objective assessment test on synthetic speech signals. Their sensitivity to various factors a_ecting the estimation quality, as well as their robustness to noise are studied. In a second experiment, their ability to label voice quality (tensed, modal, soft) is studied on a large corpus of real connected speech. It is shown that changes of voice quality are reected by signi_cant modi_cations in glottal feature distributions. Techniques based on the mixed-phase decomposition and on a closed-phase inverse _ltering process turn out to give the best results on both clean synthetic and real speech signals. On the other hand, iterative and adaptive inverse _ltering is recommended in noisy environments for its high robustness.

Mots clés

Source-tract Separation Glottal Flow Estimation Inverse Filtering Mixed-Phase Decomposition Voice Quality

Domaines

Linguistique

Fichier principal

PEER_stage2_10.1016%2Fj.csl.2011.03.003.pdf (1.51 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Hal Peer : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00770346

Soumis le : samedi 5 janvier 2013-03:51:27

Dernière modification le : mercredi 27 novembre 2019-16:23:20

Archivage à long terme le : vendredi 31 mars 2017-23:58:16

Dates et versions

hal-00770346 , version 1 (05-01-2013)

Identifiants

HAL Id : hal-00770346 , version 1
DOI : 10.1016/j.csl.2011.03.003

Citer

Thomas Drugman, Baris Bozkurt, T. Dutoit. A Comparative Study of Glottal Source Estimation Techniques. Computer Speech and Language, 2011, 26 (1), pp.20. ⟨10.1016/j.csl.2011.03.003⟩. ⟨hal-00770346⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

PEER

104 Consultations

385 Téléchargements

A Comparative Study of Glottal Source Estimation Techniques

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager