The LIA Speech Recognition System: From 10xRT to 1xRT

Georges Linarès; Pascal Nocera; D Massonié; Driss Matrouf

Communication Dans Un Congrès Année : 2007

The LIA Speech Recognition System: From 10xRT to 1xRT

(1) , (1) , , (1)

Georges Linarès

Fonction : Auteur
PersonId : 4977
IdHAL : georges-linares
IdRef : 079368794

Laboratoire Informatique d'Avignon

Pascal Nocera

Fonction : Auteur

Laboratoire Informatique d'Avignon

D Massonié

Fonction : Auteur

Driss Matrouf

Fonction : Auteur
PersonId : 176307
IdHAL : driss-matrouf
IdRef : 137773439

Laboratoire Informatique d'Avignon

Résumé

The LIA developed a speech recognition toolkit providing most of the components required by speech-to-text systems. This toolbox allowed to build a Broadcast News (BN) transcription system was involved in the ESTER evaluation campaign ([1]), on unconstrained transcription and real-time transcription tasks. In this paper, we describe the techniques we used to reach the real-time, starting from our baseline 10xRT system. We focus on some aspects of the A* search algorithm which are critical for both efficiency and accuracy. Then, we evaluate the impact of the different system components (lexicon, language models and acoustic models) to the trade-off between efficiency and accuracy. Experiments are carried out in framework of the ESTER evaluation campaign. Our results show that the real time system reaches performance on about 5.6% absolute WER whorses than the standard 10xRT system, with an absolute WER (Word Error Rate) of about 26.8%.

Domaines

Informatique [cs]

bibliothèque Universitaire Déposants HAL-Avignon : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01318280

Soumis le : jeudi 19 mai 2016-14:16:11

Dernière modification le : mardi 22 mars 2022-14:40:01

Dates et versions

hal-01318280 , version 1 (19-05-2016)

Identifiants

HAL Id : hal-01318280 , version 1

Citer

Georges Linarès, Pascal Nocera, D Massonié, Driss Matrouf. The LIA Speech Recognition System: From 10xRT to 1xRT. 10th International Conference, TSD, Sep 2007, Pilsen, Czech Republic. ⟨hal-01318280⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-AVIGNON LIA

96 Consultations

0 Téléchargements

The LIA Speech Recognition System: From 10xRT to 1xRT

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager