Skip to Main content Skip to Navigation
Conference papers

Evaluation of a Phone-Based Anomaly Detection Approach for Dysarthric Speech

Abstract : Perceptual evaluation is still the most common method in clinical practice for the diagnosing and the following of the condition progression of people with speech disorders. Many automatic approaches were proposed to provide objective tools to deal with speech disorders and help professionals in the severity evaluation of speech impairments. This paper investigates an automatic phone-based anomaly detection approach implying an automatic text-constrained phone alignment. Here, anomalies are related to speech segments, for which an unexpected acoustic pattern is observed, compared with a normal speech production. This objective tool is applied to French dysarthric speech recordings produced by patients suffering from four different pathologies. The behavior of the anomaly detection approach is studied according to the precision of the automatic phone alignment. Faced with the difficulties of having a gold standard reference, especially for the phone-based anomaly annotation , this behavior is observed on both annotated and non-annotated corpora. As expected, alignment errors (large shifts compared with a manual segmentation) lead to a large amount of anomalies automatically detected. However, about 50% of correctly detected anomalies are not related to alignment errors. This behavior shows that the automatic approach is able to catch irregular acoustic patterns of phones.
Document type :
Conference papers
Complete list of metadata

Cited literature [33 references]  Display  Hide  Download
Contributor : Corinne Fredouille <>
Submitted on : Friday, April 26, 2019 - 1:26:28 PM
Last modification on : Monday, March 29, 2021 - 2:26:07 PM


Publisher files allowed on an open archive




Imed Laaridh, Corinne Fredouille, Christine Meunier. Evaluation of a Phone-Based Anomaly Detection Approach for Dysarthric Speech. Interspeech 2016, Sep 2016, San Francisco, United States. pp.223-227, ⟨10.21437/Interspeech.2016-1077⟩. ⟨hal-02102786⟩



Record views


Files downloads