Speech Segmentation using Regression Fusion of Boundary Predictions - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue Computer Speech and Language Année : 2009

Speech Segmentation using Regression Fusion of Boundary Predictions

Iosif Mporas
  • Fonction : Auteur correspondant
  • PersonId : 901215

Connectez-vous pour contacter l'auteur
Todor Ganchev
  • Fonction : Auteur
Nikos Fakotakis
  • Fonction : Auteur

Résumé

In the present work we study the appropriateness of a number of linear and non-linear regression methods employed on the task of combining multiple phonetic boundary predictions. The proposed fusion schemes are independent of the implementation of the individual segmentation engines as well as from their number. In order to illustrate the practical significance of the proposed approach, we employ 112 speech segmentation engines based on hidden Markov models (HMMs). These engines differ in the setup of the HMMs as well as in the speech parameterization techniques they employ. In the experimental evaluation we firstly evaluate the performance of various recent speech features, which have not been tested on the speech segmentation task yet. Secondly, we evaluate the performance of several new fusion schemes for phonetic boundary predictions and finally we contrast them to some recently reported methods. Throughout this comparison, on the established for the phonetic segmentation task TIMIT database, we demonstrate that the support vector regression scheme is capable of achieving more accurate predictions, when compared to other fusion schemes reported so far.

Mots clés

Fichier principal
Vignette du fichier
PEER_stage2_10.1016%2Fj.csl.2009.04.004.pdf (553.52 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00593885 , version 1 (18-05-2011)

Identifiants

Citer

Iosif Mporas, Todor Ganchev, Nikos Fakotakis. Speech Segmentation using Regression Fusion of Boundary Predictions. Computer Speech and Language, 2009, 24 (2), pp.273. ⟨10.1016/j.csl.2009.04.004⟩. ⟨hal-00593885⟩

Collections

PEER
133 Consultations
171 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More