Skip to Main content Skip to Navigation
New interface
Conference papers

Online Spatio-Temporal 3D Convolutional Neural Network for Early Recognition of Handwritten Gestures

Abstract : Inspired by recent spatio-temporal Convolutional Neural Networks in computer vision field, we propose OLT-C3D (Online Long-Term Convolutional 3D), a new architecture based on a 3D Convolutional Neural Network (3D CNN) to address the complex task of early recognition of 2D handwritten gestures in real time. The input signal of the gesture is translated into an image sequence along time with the trajectory history. The image sequence is passed into our 3D CNN OLT-C3D which gives a prediction at each new frame. OLT-C3D is coupled with an integrated temporal reject system to postpone the decision in time if more information is needed. Moreover our system is end-to-end trainable, OLT-C3D and the temporal reject system are jointly trained to optimize the earliness of the decision. Our approach achieves superior performances on two complementary and freely available datasets: ILGDB and MTGSetB.
Complete list of metadata

https://hal.archives-ouvertes.fr/hal-03229957
Contributor : William Mocaër Connect in order to contact the contributor
Submitted on : Wednesday, May 19, 2021 - 1:27:14 PM
Last modification on : Thursday, September 1, 2022 - 3:55:44 AM
Long-term archiving on: : Friday, August 20, 2021 - 6:33:05 PM

File

ICDAR_2021__early_reco2D_VCR.p...
Files produced by the author(s)

Identifiers

  • HAL Id : hal-03229957, version 1

Citation

William Mocaër, Eric Anquetil, Richard Kulpa. Online Spatio-Temporal 3D Convolutional Neural Network for Early Recognition of Handwritten Gestures. ICDAR 2021 - 16th International Conference on Document Analysis and Recognition, Sep 2021, Lausanne, Switzerland. pp.221-236. ⟨hal-03229957⟩

Share

Metrics

Record views

324

Files downloads

77