The ETAPE corpus for the evaluation of speech-based TV content processing in the French language - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2012

The ETAPE corpus for the evaluation of speech-based TV content processing in the French language

Résumé

The paper presents a comprehensive overview of existing data for the evaluation of spoken content processing in a multimedia framework for the French language. We focus on the ETAPE corpus which will be made publicly available by ELDA at the end of 2012, after completion of the evaluation, and recall existing resources resulting from previous evaluation campaigns. The ETAPE corpus consists of 30 hours of TV and radio broadcasts, selected to cover a wide variety of topics and speaking styles, emphasizing spontaneous speech and multiple speaker areas.
Fichier principal
Vignette du fichier
final-v2.pdf (93.58 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00712591 , version 1 (01-07-2012)

Identifiants

  • HAL Id : hal-00712591 , version 1

Citer

Guillaume Gravier, Gilles Adda, Niklas Paulson, Matthieu Carré, Aude Giraudel, et al.. The ETAPE corpus for the evaluation of speech-based TV content processing in the French language. LREC - Eighth international conference on Language Resources and Evaluation, 2012, Istanbul, Turkey. ⟨hal-00712591⟩
1370 Consultations
1233 Téléchargements

Partager

Gmail Facebook X LinkedIn More