Skip to Main content Skip to Navigation
Conference papers

Sport Action Recognition with Siamese Spatio-Temporal CNNs: Application to Table Tennis

Abstract : Human action recognition in video is one of the key problems in visual data interpretation. Despite intensive research, the recognition of actions with low inter-class variability remains a challenge. This paper presents a new Siamese Spatio-Temporal Convolutional neural network (SSTC) for this purpose. When applied to table tennis, it is possible to detect and recognize 20 table tennis strokes. The model has been trained on a specific dataset, TTStroke-21, recorded in natural condition (markerless) at the Faculty of Sports of the University of Bordeaux. Our model takes as inputs a RGB image sequence and its computed Optical Flow. After 3 spatio-temporal convolutions, data are fused in a fully connected layer of a proposed siamese network architecture. Our method reaches an accuracy of 91.4% against 43.1% for our baseline.
Document type :
Conference papers
Complete list of metadata

Cited literature [21 references]  Display  Hide  Download
Contributor : Pierre-Etienne Martin Connect in order to contact the contributor
Submitted on : Tuesday, November 12, 2019 - 4:10:41 PM
Last modification on : Tuesday, January 4, 2022 - 5:36:33 AM
Long-term archiving on: : Thursday, February 13, 2020 - 5:24:26 PM


Files produced by the author(s)



Pierre-Etienne Martin, Jenny Benois-Pineau, Renaud Péteri, Julien Morlier. Sport Action Recognition with Siamese Spatio-Temporal CNNs: Application to Table Tennis. 2018 International Conference on Content-Based Multimedia Indexing (CBMI), Sep 2018, La Rochelle, France. pp.1-6, ⟨10.1109/CBMI.2018.8516488⟩. ⟨hal-02360011⟩



Les métriques sont temporairement indisponibles