Recent advances in Automatic Speech Recognition for Vietnamese - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2008

Recent advances in Automatic Speech Recognition for Vietnamese

Résumé

This paper presents our recent activities for automatic speech recognition for Vietnamese. First, our text data collection and processing methods and tools are described. For language modeling, we investigate word, sub-word and also hybrid word/sub-word models. For acoustic modeling, when only limited speech data are available for Vietnamese, we propose some crosslingual acoustic modeling techniques. Furthermore, since the use of sub-word units can reduce the high out-of-vocabulary rate and improve the lack of text resources in statistical language modeling, we propose several methods to decompose, normalize and combine word and sub-word lattices generated from different ASR systems. Experimental results evaluated on the VnSpeechCorpus demonstrate the feasibility of our methods.
Fichier principal
Vignette du fichier
le2008sltu.pdf (151.72 Ko) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte
Loading...

Dates et versions

hal-01705670 , version 1 (14-02-2018)

Identifiants

  • HAL Id : hal-01705670 , version 1

Citer

Viet-Bac Le, Laurent Besacier, Sopheap Seng, Brigitte Bigi, Thi-Ngoc-Diep Do. Recent advances in Automatic Speech Recognition for Vietnamese. The first International Workshop on Spoken Languages Technologies for Under-resourced languages, 2008, Hanoi, Vietnam. ⟨hal-01705670⟩
138 Consultations
116 Téléchargements

Partager

Gmail Facebook X LinkedIn More