Monaural speech separation and recognition challenge - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue Computer Speech and Language Année : 2009

Monaural speech separation and recognition challenge

Martin Cooke
  • Fonction : Auteur correspondant
  • PersonId : 902213

Connectez-vous pour contacter l'auteur
John R. Hershey
  • Fonction : Auteur
Steven J. Rennie
  • Fonction : Auteur

Résumé

Robust speech recognition in everyday conditions requires the solution to a number of challenging problems, not least the ability to handle multiple sound sources. The specific case of speech recognition in the presence of a competing talker has been studied for several decades, resulting in a number of quite distinct algorithmic solutions whose focus ranges from modeling both target and competing speech to speech separation using auditory grouping principles. The purpose of the monaural speech separation and recognition challenge was to permit a large-scale comparison of techniques for the competing talker problem. The task was to identify keywords in sentences spoken by a target talker when mixed into a single channel with a background talker speaking similar sentences. Ten independent sets of results were contributed, alongside a baseline recognition system. Performance was evaluated using common training and test data and common metrics. Listeners' performance in the same task was also measured. This paper describes the challenge problem, compares the performance of the contributed algorithms, and discusses the factors which distinguish the systems. One highlight of the comparison was the finding that several systems achieved near-human performance in some conditions, and one out-performed listeners overall.
Fichier principal
Vignette du fichier
PEER_stage2_10.1016%2Fj.csl.2009.02.006.pdf (390.93 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00598185 , version 1 (05-06-2011)

Identifiants

Citer

Martin Cooke, John R. Hershey, Steven J. Rennie. Monaural speech separation and recognition challenge. Computer Speech and Language, 2009, 24 (1), pp.1. ⟨10.1016/j.csl.2009.02.006⟩. ⟨hal-00598185⟩

Collections

PEER
81 Consultations
721 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More