Variability of Automatic Speech Recognition Systems Using Different Features - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2005

Variability of Automatic Speech Recognition Systems Using Different Features

Résumé

The paper describes the use of two recognizers fed by different acoustic features. The first recognizer performs Multiple Resolution Analysis (MRA) while the other recognizer computes JRASTA Perceptual Linear Prediction Coefficients (JRASTAPLP). The two recognizers use the same denoising method but perform different partitions of their acoustic spaces. Experiments with the Italian and Spanish components of the AURORA3 corpus show that the two systems provide, in a significant proportion of cases, substantially different posterior probabilities for the same phoneme in the same time interval. A decision rule is proposed when two different words are hypothesized by the two recognizers. It is based on the probability that a hypothesis is correct, given the identity of the word hypotheses that are in competition. Significant word error rate (WER) reductions have been found for the CH1 proportion of the Italian and Spanish components of the AURORA3 corpus.
Fichier principal
Vignette du fichier
Barrault-EUROSPEECH2005.pdf (138.66 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00433101 , version 1 (18-11-2009)

Identifiants

  • HAL Id : hal-00433101 , version 1

Citer

Loïc Barrault, Renato de Mori, Roberto Gemello, Franco Mana, Driss Matrouf. Variability of Automatic Speech Recognition Systems Using Different Features. European Conference on Speech Communication and Technology, Interspeech'05, Sep 2005, Lisbon, Portugal. pp.2CP2a-5. ⟨hal-00433101⟩
282 Consultations
78 Téléchargements

Partager

Gmail Facebook X LinkedIn More