Bag of n-gram driven decoding for LVCSR system harnessing - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2011

Bag of n-gram driven decoding for LVCSR system harnessing

Résumé

—This paper focuses on automatic speech recognition systems combination based on driven decoding paradigms. The driven decoding algorithm (DDA) involves the use of a 1-best hypothesis provided by an auxiliary system as another knowledge source in the search algorithm of a primary system. In previous studies, it was shown that DDA outperforms ROVER when the primary system is guided by a more accurate system. In this paper we propose a new method to manage auxiliary transcriptions which are presented as a bag-of-n-grams (BONG) without temporal matching. These modifications allow to make easier the combination of several hypotheses given by different auxiliary systems. Using BONG combination with hypotheses provided by two auxiliary systems, each of which obtained more than 23% of WER on the same data, our experiments show that a CMU Sphinx based ASR system can reduce its WER from 19.85% to 18.66% which is better than the results reached with DDA or classical ROVER combination.
Fichier non déposé

Dates et versions

hal-01315538 , version 1 (13-05-2016)

Identifiants

  • HAL Id : hal-01315538 , version 1

Citer

Bougares Fethi, Yannick Estevez, Paul Deléglise, Georges Linarès. Bag of n-gram driven decoding for LVCSR system harnessing. IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), Dec 2011, Waikoloa, United States. ⟨hal-01315538⟩
107 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More