Variability of Automatic Speech Recognition Systems Using Different Features

Loïc Barrault; Renato de Mori; Roberto Gemello; Franco Mana; Driss Matrouf

Communication Dans Un Congrès Année : 2005

Variability of Automatic Speech Recognition Systems Using Different Features

(1, 2) , (2) , (3) , (3) , (2)

1
2
3

Loïc Barrault

Fonction : Auteur
PersonId : 15276
IdHAL : loicbarrault
ORCID : 0000-0002-0634-6147
IdRef : 131912488

Laboratoire d'Informatique de l'Université du Maine

Laboratoire Informatique d'Avignon

Renato de Mori

Fonction : Auteur

Laboratoire Informatique d'Avignon

Roberto Gemello

Fonction : Auteur

LOQUENDO

Franco Mana

Fonction : Auteur

LOQUENDO

Driss Matrouf

Fonction : Auteur
PersonId : 176307
IdHAL : driss-matrouf
IdRef : 137773439

Laboratoire Informatique d'Avignon

Résumé

The paper describes the use of two recognizers fed by different acoustic features. The first recognizer performs Multiple Resolution Analysis (MRA) while the other recognizer computes JRASTA Perceptual Linear Prediction Coefficients (JRASTAPLP). The two recognizers use the same denoising method but perform different partitions of their acoustic spaces. Experiments with the Italian and Spanish components of the AURORA3 corpus show that the two systems provide, in a significant proportion of cases, substantially different posterior probabilities for the same phoneme in the same time interval. A decision rule is proposed when two different words are hypothesized by the two recognizers. It is based on the probability that a hypothesis is correct, given the identity of the word hypotheses that are in competition. Significant word error rate (WER) reductions have been found for the CH1 proportion of the Italian and Spanish components of the AURORA3 corpus.

Mots clés

feature variability automatic speech recognition

Domaines

Traitement du signal et de l'image [eess.SP] Traitement du signal et de l'image [eess.SP]

Fichier principal

Barrault-EUROSPEECH2005.pdf (138.66 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Loïc BARRAULT : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00433101

Soumis le : mercredi 18 novembre 2009-10:55:39

Dernière modification le : vendredi 24 mars 2023-14:52:52

Archivage à long terme le : jeudi 17 juin 2010-18:51:14

Dates et versions

hal-00433101 , version 1 (18-11-2009)

Identifiants

HAL Id : hal-00433101 , version 1

Citer

Loïc Barrault, Renato de Mori, Roberto Gemello, Franco Mana, Driss Matrouf. Variability of Automatic Speech Recognition Systems Using Different Features. European Conference on Speech Communication and Technology, Interspeech'05, Sep 2005, Lisbon, Portugal. pp.2CP2a-5. ⟨hal-00433101⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-AVIGNON CNRS UNIV-LEMANS LIUM LIUM-LST LIA

282 Consultations

78 Téléchargements

Variability of Automatic Speech Recognition Systems Using Different Features

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager