Development of a Korean speech recognition system with little annontated data

Antoine Laurent; Lori Lamel

Communication Dans Un Congrès Année : 2014

Development of a Korean speech recognition system with little annontated data

(1) , (1)

Antoine Laurent

Fonction : Auteur
PersonId : 1034318

Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur

Lori Lamel

Fonction : Auteur
PersonId : 15965
IdHAL : lori-lamel
ORCID : 0000-0001-7443-9938
IdRef : 127578056

Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur

Résumé

This paper investigates the development of a speech-to-text transcription system for the Korean language in the context of the DGA RAPID Rapmat project. Korean is an alpha-syllabary language spoken by about 78 million people worldwide. As only a small amount of manually transcribed audio data were available, the acoustic models were trained on audio data downloaded from several Korean websites in an unsupervised manner, and the language models were trained on web texts. The reported word and character error rates are estimates, as development corpus used in these experiments was also constructed from the untranscribed audio data, the web texts and automatic transcriptions. Several variants for unsupervised acoustic model training were compared to assess the influence of the vocabulary size (200k vs 2M), the type of language model (words vs characters), the acoustic unit (phonemes vs half-syllables), as well as incremental batch vs iterative decoding of the untranscribed audio corpus.

Mots clés

Speech recognition system unsupervised acoustic training korean approximative transcripts

Domaines

Informatique [cs] Informatique et langage [cs.CL]

Limsi Publications : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01843405

Soumis le : mercredi 18 juillet 2018-16:55:38

Dernière modification le : samedi 7 octobre 2023-21:36:20

Dates et versions

hal-01843405 , version 1 (18-07-2018)

Identifiants

HAL Id : hal-01843405 , version 1

Citer

Antoine Laurent, Lori Lamel. Development of a Korean speech recognition system with little annontated data. International Workshop on Spoken Languages Technologies for Under-resourced languages, May 2014, St Petersburg, Russia. ⟨hal-01843405⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS LIMSI SORBONNE-UNIVERSITE LISN

12 Consultations

0 Téléchargements

Development of a Korean speech recognition system with little annontated data

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager