Which unit for acoustic and language modeling for Khmer Automatic Speech Recognition?

Sopheap Seng; Sethserey Sam; Viet-Bac Le; Brigitte Bigi; Laurent Besacier

Communication Dans Un Congrès Année : 2008

Which unit for acoustic and language modeling for Khmer Automatic Speech Recognition?

(1) , (2) , , (1) , (3)

1
2
3

Sopheap Seng

Fonction : Auteur

Communication Langagière et Interaction Personne-Système

Sethserey Sam

Fonction : Auteur

Department of Mathematics [MIT]

Viet-Bac Le

Fonction : Auteur

Brigitte Bigi

Fonction : Auteur
PersonId : 7990
IdHAL : brigittebigi
ORCID : 0000-0003-1834-6918
IdRef : 079410790

Communication Langagière et Interaction Personne-Système

Laurent Besacier

Fonction : Auteur
PersonId : 1521
IdHAL : laurent-besacier
ORCID : 0000-0001-7411-9125
IdRef : 079377017

Groupe d’Étude en Traduction Automatique/Traitement Automatisé des Langues et de la Parole

Résumé

In this paper we present an overview on the development of a large vocabulary continuous speech recognition system for Khmer language. Methods and tools used for quick language resources collection for the development of an ASR system for a new under-resourced language are presented. Face with the problem of lack of text data and the word error segmentation in language modeling, we investigate how different views of the text data (word and sub-word units) can be exploited for Khmer language modeling. We propose to work both at the model level (by making hybrid vocabularies with both word and sub-word units) as well as at the ASR output level (by using a simple N-best list voting mechanism). For acoustic modeling, we use basic linguistic rules to automatically generate pronunciation dictionaries based on grapheme and phoneme. An experimental framework is setup to evaluate the performance of each modeling units. Index Terms-ASR, Khmer, word and sub-word units, acoustic modeling, language modeling.

Mots clés

Speech ASR Khmer

Domaines

Informatique et langage [cs.CL] Sciences de l'information et de la communication

Fichier principal

seng2008sltu.pdf (193.96 Ko)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

Brigitte Bigi : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01392526

Soumis le : mardi 13 décembre 2016-14:47:00

Dernière modification le : jeudi 4 avril 2024-20:58:22

Archivage à long terme le : mardi 14 mars 2017-11:51:01

Dates et versions

hal-01392526 , version 1 (13-12-2016)

Licence

Identifiants

HAL Id : hal-01392526 , version 1

Citer

Sopheap Seng, Sethserey Sam, Viet-Bac Le, Brigitte Bigi, Laurent Besacier. Which unit for acoustic and language modeling for Khmer Automatic Speech Recognition?. International Workshop on Spoken Languages Technologies for Under-resourced languages, 2008, Hanoi, Vietnam. pp.33-38. ⟨hal-01392526⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA IMAG CNRS LIG LIG_TDCGE_GETALP CAMPUS-AAR AAI POLYTECH-GRENOBLE LIG_SIDCH

209 Consultations

226 Téléchargements

Which unit for acoustic and language modeling for Khmer Automatic Speech Recognition?

Résumé

Mots clés

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Partager