Speech and Speaker Recognition for Home Automation: Preliminary Results

In voice controlled multi-room smart homes ASR and speaker identification systems face distance speech conditions which have a significant impact on performance. Regarding voice command recognition, this paper presents an approach which selects dynamically the best channel and adapts models to the environmental conditions. The method has been tested on data recorded with 11 elderly and visually impaired participants in a real smart home. The voice command recognition error rate was 3.2% in off-line condition and of 13.2% in online condition. For speaker identification, the performances were below very speaker dependant. However, we show a high correlation between performance and training size. The main difficulty was the too short utterance duration in comparison to state of the art studies. Moreover, speaker identification performance depends on the size of the adapting corpus and then users must record enough data before using the system.

Mots clés

Speaker recognition Vocal command Home Automation Voice controlled smart home

Domaines

Autre [cs.OH]

Fichier principal

2015_SPED_Vacher_HAL.pdf (279.96 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Michel Vacher : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01207692

Soumis le : mardi 27 octobre 2015-15:21:00

Dernière modification le : jeudi 4 avril 2024-18:26:59

Archivage à long terme le : vendredi 28 avril 2017-05:40:57

Dates et versions

hal-01207692 , version 1 (01-10-2015)

hal-01207692 , version 2 (27-10-2015)

Identifiants

HAL Id : hal-01207692 , version 2

Citer

Michel Vacher, Benjamin Lecouteux, Javier Serrano-Romero, Moez Ajili, François Portet, et al.. Speech and Speaker Recognition for Home Automation: Preliminary Results. 8th International Conference Speech Technology and Human-Computer Dialogue "SpeD 2015", Oct 2015, Bucarest, Romania. pp.181-190. ⟨hal-01207692v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA CNRS LIG LIG_TDCGE_GETALP ANR LIG_SIDCH

197 Consultations

1370 Téléchargements