The Usability of Speech and/or Gestures in Multi-Modal Interface Systems

Farzana Alibay; Manolya Kavakli; Jean-Rémy Chardonnet; Muhammad Zeeshan Baig

Communication Dans Un Congrès Année : 2017

The Usability of Speech and/or Gestures in Multi-Modal Interface Systems

(1) , (1) , (2) , (1)

1
2

Farzana Alibay

Fonction : Auteur

Macquarie University

Manolya Kavakli

Fonction : Auteur

Macquarie University

Jean-Rémy Chardonnet

Fonction : Auteur
PersonId : 15245
IdHAL : jean-remy-chardonnet
ORCID : 0000-0002-8926-1359
IdRef : 136736912

Laboratoire Electronique, Informatique et Image [UMR6306]

Muhammad Zeeshan Baig

Fonction : Auteur

Macquarie University

Résumé

Multi-Modal Interface Systems (MMIS) have proliferated in the last few decades, since they provide a direct interface for both Human Computer Interaction (HCI) and face-to-face communication. Our aim is to provide users without any prior 3D modelling experience, with a multi-modal interface to create a 3D object. The system also incorporates help throughout the drawing process and identifies simple words and gestures to accomplish a range of (simple to complex) modeling tasks. We have developed a multi-modal interface that allows users to design objects in 3D, using AutoCAD commands as well as speech and gesture. We have used a microphone to collect speech input and a Leap Motion sensor to collect gesture input in real time. Two sets of experiments were conducted to investigate the usability of the system and evaluate the system performance using Leap Motion versus keyboard and mouse. Our results indicate that performing a task using speech is perceived exhausting, when there is no shared vocabulary between man and machine, and the usability of traditional input devices supersedes the usability of speech and gestures. Only a small ratio of participants, less than 7% in our experiments were able to carry out the tasks with appropriate precision.

Mots clés

Gesture Speech Semantics Emotion recognition Kinect 3D object Leap Motion

Domaines

Interface homme-machine [cs.HC] Synthèse d'image et réalité virtuelle [cs.GR]

Fichier principal

LE2I_ICCAE_2017_CHARDONNET.pdf (584.06 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Compte De Service Administrateur Ensam : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01502555

Soumis le : mercredi 5 avril 2017-17:01:14

Dernière modification le : jeudi 7 septembre 2023-16:08:10

Archivage à long terme le : jeudi 6 juillet 2017-13:53:24

Dates et versions

hal-01502555 , version 1 (05-04-2017)

Identifiants

HAL Id : hal-01502555 , version 1
ENSAM : http://hdl.handle.net/10985/11682

Citer

Farzana Alibay, Manolya Kavakli, Jean-Rémy Chardonnet, Muhammad Zeeshan Baig. The Usability of Speech and/or Gestures in Multi-Modal Interface Systems. International Conference on Computer and Automation Engineering (ICCAE 2017), Feb 2017, Sydney, Australia. pp.1-5. ⟨hal-01502555⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-BOURGOGNE CNRS LE2I HESAM IRENAV LAMPA LCPI LABOMAP LISPEN MSMP

127 Consultations

157 Téléchargements

The Usability of Speech and/or Gestures in Multi-Modal Interface Systems

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager