Étude de l'informativité des transcriptions : une approche basée sur le résumé automatique

Abstract : In this paper we propose a new approach to evaluate the informativeness of transcriptions coming from Automatic Speech Recognition systems. This approach, based in the notion of informativeness, is focused on the framework of Automatic Text Summarization performed over these transcriptions. At a first glance we estimate the informative content of the various automatic transcriptions, then we explore the capacity of Automatic Text Summarization to overcome the informative loss. To do this we use an automatic summary evaluation protocol without reference (based on the informative content), which computes the divergence between probability distributions of different textual representations: manual and automatic transcriptions and their summaries. After a set of evaluations this analysis allowed us to judge both the quality of the transcriptions in terms of informativeness and to assess the ability of automatic text summarization to compensate the problems raised during the transcription phase.
Complete list of metadatas

Cited literature [2 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01822585
Contributor : Carlos-Emiliano Gonzalez-Gallardo <>
Submitted on : Monday, June 25, 2018 - 12:26:56 PM
Last modification on : Friday, March 22, 2019 - 11:34:07 AM
Long-term archiving on : Wednesday, September 26, 2018 - 1:55:33 PM

File

coria_final_24_04.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01822585, version 1

Collections

Citation

Carlos-Emiliano González-Gallardo, Malek Hajjem, Eric Sanjuan, Juan-Manuel Torres-Moreno. Étude de l'informativité des transcriptions : une approche basée sur le résumé automatique. Conférence en Recherche d’Information et Applications (CORIA), May 2018, Rennes, France. ⟨hal-01822585⟩

Share

Metrics

Record views

77

Files downloads

57