Soft Spatial Attention-Based Multimodal Driver Action Recognition Using Deep Learning

Imen Jegham; Anouar Ben Khalifa; Ihsen Alouani; Mohamed Ali Mahjoub

doi:10.1109/JSEN.2020.3019258

Article Dans Une Revue IEEE Sensors Journal Année : 2021

Soft Spatial Attention-Based Multimodal Driver Action Recognition Using Deep Learning

, , (1, 2) ,

1
2

Imen Jegham

Fonction : Auteur

Anouar Ben Khalifa

Fonction : Auteur

Ihsen Alouani

Fonction : Auteur
PersonId : 748209
IdHAL : ihsen-alouani
ORCID : 0000-0001-5102-8087
IdRef : 193940922

Institut d’Électronique, de Microélectronique et de Nanotechnologie - UMR 8520

COMmunications NUMériques - IEMN

Mohamed Ali Mahjoub

Fonction : Auteur
PersonId : 868425

Résumé

Driver behaviors and decisions are crucial factors for on-road driving safety. With a precise driver behavior monitoring system, traffic accidents and injuries can be significantly reduced. However, understanding human behaviors in real-world driving settings is a challenging task because of the uncontrolled conditions including illumination variation, occlusion, and dynamic and cluttered background. In this paper, a Kinect sensor, which provides multimodal signals, is adopted as a driver monitoring sensor to recognize safe driving and common secondary most distracting in-vehicle actions. We propose a novel soft spatial attention-based network named the Depth-based Spatial Attention network (DSA), which adds a cognitive process to deep network by selectively focusing on the driver's silhouette and motion in the cluttered driving scene. In fact, at each time t, we introduce a new weighted RGB frame based on an attention model designed using a depth frame. The final classification accuracy is substantially enhanced compared to the state-of-the-art results with an achieved improvement of up to 27%. © 2001-2012 IEEE.

Mots clés

deep learning Driver action recognition kinect sensor multimodal spatial soft attention

Domaines

Physique [physics] Informatique [cs] Sciences de l'ingénieur [physics] Réseaux et télécommunications [cs.NI] Intelligence artificielle [cs.AI] Electronique Traitement du signal et de l'image [eess.SP]

Collection IEMN : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03542171

Soumis le : mardi 25 janvier 2022-10:53:21

Dernière modification le : mercredi 24 janvier 2024-09:54:25

Dates et versions

hal-03542171 , version 1 (25-01-2022)

Identifiants

HAL Id : hal-03542171 , version 1
DOI : 10.1109/JSEN.2020.3019258

Citer

Imen Jegham, Anouar Ben Khalifa, Ihsen Alouani, Mohamed Ali Mahjoub. Soft Spatial Attention-Based Multimodal Driver Action Recognition Using Deep Learning. IEEE Sensors Journal, 2021, 21 (2), pp.1918-1925. ⟨10.1109/JSEN.2020.3019258⟩. ⟨hal-03542171⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS UNIV-VALENCIENNES IEMN UNIV-LILLE INSA-GROUPE INSA-HAUTS-DE-FRANCE

20 Consultations

0 Téléchargements

Soft Spatial Attention-Based Multimodal Driver Action Recognition Using Deep Learning

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager