Towards Interpreting Deep Learning Models to Understand Loss of Speech Intelligibility in Speech Disorders — Step 2: Contribution of the emergence of phonetic traits

Sondes Abderrazek; Corinne Fredouille; Alain Ghio; Muriel Lalain; Christine Meunier; Virginie Woisard

doi:10.1109/ICASSP43922.2022.9746198

Communication Dans Un Congrès Année : 2022

Towards Interpreting Deep Learning Models to Understand Loss of Speech Intelligibility in Speech Disorders — Step 2: Contribution of the emergence of phonetic traits

(1, 2) , (2, 1) , (3) , (3) , (3) , (4, 5)

1
2
3
4
5

Sondes Abderrazek

Fonction : Auteur
PersonId : 750420
IdHAL : sondes-abderrazek
ORCID : 0000-0003-1353-8152

Laboratoire Informatique d'Avignon

Avignon Université

Corinne Fredouille

Fonction : Auteur
PersonId : 173870
IdHAL : corinne-fredouille
ORCID : 0000-0002-0413-8950
IdRef : 079420516

Avignon Université

Laboratoire Informatique d'Avignon

Alain Ghio

Fonction : Auteur
PersonId : 6135
IdHAL : alain-ghio
ORCID : 0000-0001-7302-0799
IdRef : 133448975

Laboratoire Parole et Langage

Muriel Lalain

Fonction : Auteur

Laboratoire Parole et Langage

Christine Meunier

Fonction : Auteur

Laboratoire Parole et Langage

Virginie Woisard

Fonction : Auteur

Centre Hospitalier Universitaire de Toulouse

Laboratoire de NeuroPsychoLinguistique

Résumé

Apart from the impressive performance it has achieved in several tasks, one of the most important factors remaining for the continuous progress of deep learning is the increased work related to interpretability, especially in a medical context. In a recent work, we presented competitive performance achieved with a CNN-based model trained on normal speech for the French phone classification and how it correlates well with different perceptual measures when exposed to disordered speech. This paper extends that work by focusing on interpretability. Here, the goal is to get insights into the way in which neural representations shape the final task of phone classification so that it can be used further to explain the loss of intelligibility in disordered speech. In this way, an original framework is proposed, relying firstly on the neural activity and a novel representation per neuron, here considering the phone classification, and, secondly, permitting to identify a set of neurons devoted to the detection of specific phonetic traits on normal speech. Faced to disordered speech, a degradation of that set of neurons is observed, demonstrating a loss of specific phonetic traits in some patients involved, and the potentiality of the proposed approaches to inform about speech alteration.

Mots clés

Deep learning Interpretability Phonetic traits Intelligibility Head and Neck Cancer Speech disorders

Domaines

Informatique [cs] Intelligence artificielle [cs.AI]

Fichier principal

ICASSP_2022_FINAL.pdf (726.13 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

sondes ABDERRAZEK : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03843238

Soumis le : vendredi 11 novembre 2022-10:49:50

Dernière modification le : lundi 20 novembre 2023-11:44:24

Archivage à long terme le : dimanche 12 février 2023-18:15:09

Dates et versions

hal-03843238 , version 1 (11-11-2022)

Identifiants

HAL Id : hal-03843238 , version 1
DOI : 10.1109/ICASSP43922.2022.9746198

Citer

Sondes Abderrazek, Corinne Fredouille, Alain Ghio, Muriel Lalain, Christine Meunier, et al.. Towards Interpreting Deep Learning Models to Understand Loss of Speech Intelligibility in Speech Disorders — Step 2: Contribution of the emergence of phonetic traits. ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), May 2022, Singapore, Singapore. pp.7387-7391, ⟨10.1109/ICASSP43922.2022.9746198⟩. ⟨hal-03843238⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-AVIGNON UNIV-TLSE2 CNRS UNIV-AMU OCTOGONE LPL-AIX LIA ANR INCIAM UNIV-UT3 UT3-TOULOUSEINP

49 Consultations

66 Téléchargements

Towards Interpreting Deep Learning Models to Understand Loss of Speech Intelligibility in Speech Disorders — Step 2: Contribution of the emergence of phonetic traits

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager