Development of a Korean speech recognition system with little annontated data - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2014

Development of a Korean speech recognition system with little annontated data

Résumé

This paper investigates the development of a speech-to-text transcription system for the Korean language in the context of the DGA RAPID Rapmat project. Korean is an alpha-syllabary language spoken by about 78 million people worldwide. As only a small amount of manually transcribed audio data were available, the acoustic models were trained on audio data downloaded from several Korean websites in an unsupervised manner, and the language models were trained on web texts. The reported word and character error rates are estimates, as development corpus used in these experiments was also constructed from the untranscribed audio data, the web texts and automatic transcriptions. Several variants for unsupervised acoustic model training were compared to assess the influence of the vocabulary size (200k vs 2M), the type of language model (words vs characters), the acoustic unit (phonemes vs half-syllables), as well as incremental batch vs iterative decoding of the untranscribed audio corpus.
Fichier non déposé

Dates et versions

hal-01843405 , version 1 (18-07-2018)

Identifiants

  • HAL Id : hal-01843405 , version 1

Citer

Antoine Laurent, Lori Lamel. Development of a Korean speech recognition system with little annontated data. International Workshop on Spoken Languages Technologies for Under-resourced languages, May 2014, St Petersburg, Russia. ⟨hal-01843405⟩
12 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More