Using Phonologically Weighted Levenshtein Distances for the Prediction of Microscopic Intelligibility - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2016

Using Phonologically Weighted Levenshtein Distances for the Prediction of Microscopic Intelligibility

Résumé

This article presents a new method for analyzing Automatic Speech Recognition (ASR) results at the phonological feature level. To this end the Levenshtein distance algorithm is refined in order to take into account the distinctive features opposing substituted phonemes. This method allows to survey features additions or deletions, providing microscopic qualitative information as a complement to word recognition scores. To explore the relevance of the qualitative data gathered by this method, a study is conducted on a speech corpus simulating presbycusis effects on speech perception at eight severity stages. Consonantic features additions and deletions in ASR outputs are analyzed and put in relation with intelligibility data collected in 30 human subjects. ASR results show monotonic trends in most conso- nantic features along the degradation conditions, which appear to be consistent with the misperceptions that could be observed in human subjects.
Fichier principal
Vignette du fichier
fontan_17158.pdf (542.69 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-01474904 , version 1 (23-02-2017)

Identifiants

Citer

Lionel Fontan, Isabelle Ferrané, Jérôme Farinas, Julien Pinquier, Xavier Aumont. Using Phonologically Weighted Levenshtein Distances for the Prediction of Microscopic Intelligibility. Annual conference Interspeech (INTERSPEECH 2016), Sep 2016, San Francisco, CA, United States. pp.650-654, ⟨10.21437/Interspeech.2016-431⟩. ⟨hal-01474904⟩
301 Consultations
1425 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More