Acoustic modeling for under-resourced languages based on vectorial HMM-states representation using Subspace Gaussian Mixture Models

Mohamed Bouallegue; Emmanuel Ferreira; Driss Matrouf; Georges Linarès; Maria Goudi; Pascal Nocera

doi:10.1109/SLT.2012.6424245

Communication Dans Un Congrès Année : 2012

Acoustic modeling for under-resourced languages based on vectorial HMM-states representation using Subspace Gaussian Mixture Models

(1) , (1) , (1) , (1) , (1) , (1)

Mohamed Bouallegue

Fonction : Auteur
PersonId : 772200
IdRef : 177675128

Laboratoire Informatique d'Avignon

Emmanuel Ferreira

Fonction : Auteur
PersonId : 772440
IdRef : 192901885

Laboratoire Informatique d'Avignon

Driss Matrouf

Fonction : Auteur
PersonId : 176307
IdHAL : driss-matrouf
IdRef : 137773439

Laboratoire Informatique d'Avignon

Georges Linarès

Fonction : Auteur
PersonId : 4977
IdHAL : georges-linares
IdRef : 079368794

Laboratoire Informatique d'Avignon

Maria Goudi

Fonction : Auteur

Laboratoire Informatique d'Avignon

Pascal Nocera

Fonction : Auteur

Laboratoire Informatique d'Avignon

Résumé

This paper explores a novel method for context-dependent models in automatic speech recognition (ASR), in the context of under-resourced languages. We present a simple way to realize a tying states approach, based on a new vectorial representation of the HMM states. This vectorial representation is considered as a vector of a low number of parameters obtained by the Subspace Gaussian Mixture Models paradigm (SGMM). The proposed method does not require phonetic knowledge or a large amount of data, which represent the major problems of acoustic modeling for under-resourced languages. This paper shows how this representation can be obtained and used for tying states. Our experiments, applied on Vietnamese, show that this approach achieves a stable gain compared to the classical approach which is based on decision trees. Furthermore, this method appears to be portable to other languages, as shown in the preliminary study conducted on Berber.

Mots clés

Index Terms— Acoustic Modelling under-resourced languages HMM-state vector representation state-tying Subspace Gaussian Mixture Models

Domaines

Informatique [cs]

bibliothèque Universitaire Déposants HAL-Avignon : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01313103

Soumis le : lundi 9 mai 2016-15:34:12

Dernière modification le : mardi 22 mars 2022-14:40:01

Dates et versions

hal-01313103 , version 1 (09-05-2016)

Identifiants

HAL Id : hal-01313103 , version 1
DOI : 10.1109/SLT.2012.6424245

Citer

Mohamed Bouallegue, Emmanuel Ferreira, Driss Matrouf, Georges Linarès, Maria Goudi, et al.. Acoustic modeling for under-resourced languages based on vectorial HMM-states representation using Subspace Gaussian Mixture Models. IEEE Spoken Language Technology Workshop (SLT), Dec 2012, Miami, United States. ⟨10.1109/SLT.2012.6424245⟩. ⟨hal-01313103⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-AVIGNON LIA

58 Consultations

0 Téléchargements

Acoustic modeling for under-resourced languages based on vectorial HMM-states representation using Subspace Gaussian Mixture Models

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager