Skip to Main content Skip to Navigation
Journal articles

Frame-Synchronous and Local Confidence Measures for Automatic Speech recognition

Joseph Razik 1 Odile Mella 2 Dominique Fohr 2 Jean-Paul Haton 2
2 PAROLE - Analysis, perception and recognition of speech
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : In this paper, we introduce two new confidence measures for large vocabulary speech recognition systems. The major feature of these measures is that they can be computed without waiting for the end of the audio stream. We proposed two kinds of confidence measures: frame-synchronous and local. The frame-synchronous ones can be computed as soon as a frame is processed by the recognition engine and are based on a likelihood ratio. The local measures estimate a local posterior probability in the vicinity of the word to analyze. We evaluated our confidence measures within the framework of the automatic transcription of French broadcast news with the EER criterion. Our local measures achieved results very close to the best state-of-the-art measure (EER of 23% compared to 22.0%). We then conducted a preliminary experiment to assess the contribution of our confidence measure in improving the comprehension of an automatic transcription for the hearing impaired. We introduced several modalities to highlight words of low confidence in this transcription. We showed that these modalities used with our local confidence measure improved the comprehension of automatic transcription
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-00579092
Contributor : Dominique Fohr <>
Submitted on : Wednesday, March 23, 2011 - 9:51:14 AM
Last modification on : Thursday, March 5, 2020 - 4:55:47 PM

Identifiers

Collections

Citation

Joseph Razik, Odile Mella, Dominique Fohr, Jean-Paul Haton. Frame-Synchronous and Local Confidence Measures for Automatic Speech recognition. International Journal of Pattern Recognition and Artificial Intelligence, World Scientific Publishing, 2011, 25 (2), pp.1-26. ⟨10.1142/S0218001411008543⟩. ⟨hal-00579092⟩

Share

Metrics

Record views

318