Comparison of approaches for an efficient phonetic decoding - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2013

Comparison of approaches for an efficient phonetic decoding

Résumé

This article analyzes the phonetic decoding performance obtained with different choices of linguistic units. The context is to later use such an approach as a support for helping communication with deaf people, and to run it on an embedded decoder on a portable terminal, which introduces constrains on the model size. As a first step, this paper presents and analyses the performance of various approaches. Two baseline systems are considered, one relying on a large vocabulary speech recognizer, and another one relying on a phonetic n-gram language model. Then syllable-based lexicons and language models are investigated. Various lexicon sizes are studied by setting thresholds on their frequency of occurrences in the training data. Evaluations are conducted on the ESTER and ETAPE speech corpora. Keeping only the most frequent syllables leads to a limited-size lexicon and language model, which nevertheless provides good phonetic decoding performance. The phone error rate is only 4% worse (absolute) than the phone error rate obtained with the large vocabulary recognizer, and much better than the phone error rate obtained with the phone n-gram language model.
Fichier principal
Vignette du fichier
articleIS2013-Luiza-Orosanu-final.pdf (137.28 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-00834284 , version 1 (25-03-2016)

Identifiants

  • HAL Id : hal-00834284 , version 1

Citer

Luiza Orosanu, Denis Jouvet. Comparison of approaches for an efficient phonetic decoding. InterSpeech - 14th Annual Conference of the International Speech Communication Association - 2013, Aug 2013, Lyon, France. ⟨hal-00834284⟩
197 Consultations
100 Téléchargements

Partager

Gmail Facebook X LinkedIn More