The third `CHiME' Speech Separation and Recognition Challenge: Dataset, task and baselines - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2015

The third `CHiME' Speech Separation and Recognition Challenge: Dataset, task and baselines

Résumé

The CHiME challenge series aims to advance far field speech recognition technology by promoting research at the interface of signal processing and automatic speech recognition. This paper presents the design and outcomes of the 3rd CHiME Challenge, which targets the performance of automatic speech recognition in a real-world, commercially-motivated scenario: a person talking to a tablet device that has been fitted with a six-channel microphone array. The paper describes the data collection, the task definition and the base-line systems for data simulation, enhancement and recognition. The paper then presents an overview of the 26 systems that were submitted to the challenge focusing on the strategies that proved to be most successful relative to the MVDR array processing and DNN acoustic modeling reference system. Challenge findings related to the role of simulated data in system training and evaluation are discussed.
Fichier principal
Vignette du fichier
barker_ASRU15.pdf (171.81 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01211376 , version 1 (05-10-2015)

Identifiants

  • HAL Id : hal-01211376 , version 1

Citer

Jon Barker, Ricard Marxer, Emmanuel Vincent, Shinji Watanabe. The third `CHiME' Speech Separation and Recognition Challenge: Dataset, task and baselines. 2015 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2015), Dec 2015, Scottsdale, AZ, United States. ⟨hal-01211376⟩
672 Consultations
5304 Téléchargements

Partager

Gmail Facebook X LinkedIn More