Recent advances in Automatic Speech Recognition for Vietnamese

Viet-Bac Le; Laurent Besacier; Sopheap Seng; Brigitte Bigi; Thi-Ngoc-Diep Do

Communication Dans Un Congrès Année : 2008

Recent advances in Automatic Speech Recognition for Vietnamese

(1) , (2) , (2) , (2) , (2)

1
2

Viet-Bac Le

Fonction : Auteur

Vocapia Research [Orsay]

Laurent Besacier

Fonction : Auteur
PersonId : 1521
IdHAL : laurent-besacier
ORCID : 0000-0001-7411-9125
IdRef : 079377017

Communication Langagière et Interaction Personne-Système

Sopheap Seng

Fonction : Auteur
PersonId : 992824

Communication Langagière et Interaction Personne-Système

Brigitte Bigi

Fonction : Auteur
PersonId : 7990
IdHAL : brigittebigi
ORCID : 0000-0003-1834-6918
IdRef : 079410790

Communication Langagière et Interaction Personne-Système

Thi-Ngoc-Diep Do

Fonction : Auteur
PersonId : 955335

Communication Langagière et Interaction Personne-Système

Résumé

This paper presents our recent activities for automatic speech recognition for Vietnamese. First, our text data collection and processing methods and tools are described. For language modeling, we investigate word, sub-word and also hybrid word/sub-word models. For acoustic modeling, when only limited speech data are available for Vietnamese, we propose some crosslingual acoustic modeling techniques. Furthermore, since the use of sub-word units can reduce the high out-of-vocabulary rate and improve the lack of text resources in statistical language modeling, we propose several methods to decompose, normalize and combine word and sub-word lattices generated from different ASR systems. Experimental results evaluated on the VnSpeechCorpus demonstrate the feasibility of our methods.

Mots clés

language modeling acoustic modeling word sub-word unit Index Terms – ASR Vietnamese

Domaines

Informatique et langage [cs.CL] Sciences de l'information et de la communication

Fichier principal

le2008sltu.pdf (151.72 Ko)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

Brigitte Bigi : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01705670

Soumis le : mercredi 14 février 2018-10:29:19

Dernière modification le : jeudi 4 avril 2024-20:55:18

Archivage à long terme le : vendredi 4 mai 2018-10:13:33

Dates et versions

hal-01705670 , version 1 (14-02-2018)

Identifiants

HAL Id : hal-01705670 , version 1

Citer

Viet-Bac Le, Laurent Besacier, Sopheap Seng, Brigitte Bigi, Thi-Ngoc-Diep Do. Recent advances in Automatic Speech Recognition for Vietnamese. The first International Workshop on Spoken Languages Technologies for Under-resourced languages, 2008, Hanoi, Vietnam. ⟨hal-01705670⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA IMAG CNRS POLYTECH-GRENOBLE

138 Consultations

116 Téléchargements

Recent advances in Automatic Speech Recognition for Vietnamese

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager