Fine grained sport action recognition with Twin spatio-temporal convolutional neural networks

Pierre-Etienne Martin; Jenny Benois-Pineau; Renaud Péteri; Julien Morlier

doi:10.1007/s11042-020-08917-3

Article Dans Une Revue Multimedia Tools and Applications Année : 2020

Fine grained sport action recognition with Twin spatio-temporal convolutional neural networks

(1) , (1) , (2) , (3)

1
2
3

Pierre-Etienne Martin

Fonction : Auteur
PersonId : 179217
IdHAL : pierre-etienne-martin
ORCID : 0000-0002-9593-4580

Laboratoire Bordelais de Recherche en Informatique

Jenny Benois-Pineau

Fonction : Auteur
PersonId : 7842
IdHAL : jenny-benois-pineau
ORCID : 0000-0003-0659-8894
IdRef : 074466992

Laboratoire Bordelais de Recherche en Informatique

Renaud Péteri

Fonction : Auteur
PersonId : 179346
IdHAL : renaud-peteri
ORCID : 0000-0002-6584-4189
IdRef : 161560822

Mathématiques, Image et Applications - EA 3165

Julien Morlier

Fonction : Auteur
PersonId : 844020

Laboratoire de l'intégration, du matériau au système

Résumé

Human action recognition in video is one of the key problems in visual data interpretation. Despite intensive research, the recognition of actions with low inter-class variability remains a challenge. This paper presents a new Siamese Spatio-Temporal Convolutional Neural Network (SSTCNN) for this purpose. When applied to table tennis, it is possible to detect and recognize 20 table tennis strokes. The model has been trained on a specific dataset, so called TTStroke-21, recorded in natural conditions at the Faculty of Sports of the University of Bordeaux. Our model takes as inputs a RGB image sequence and its computed residual Optical Flow. The proposed siamese network architecture comprises 3 spatio-temporal convolutional layers, followed by a fully connected layer where data are fused. Our method reaches an accuracy of 91.4% against 43.1% for our baseline.

Domaines

Vision par ordinateur et reconnaissance de formes [cs.CV] Intelligence artificielle [cs.AI] Informatique [cs] Interface homme-machine [cs.HC] Apprentissage [cs.LG] Multimédia [cs.MM] Réseau de neurones [cs.NE] Traitement des images [eess.IV] Traitement du signal et de l'image [eess.SP]

Fichier principal

MTAP.pdf (6.31 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Pierre-Etienne Martin : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02551019

Soumis le : mardi 16 juin 2020-19:12:49

Dernière modification le : lundi 5 juin 2023-16:52:11

Dates et versions

hal-02551019 , version 1 (16-06-2020)

Identifiants

HAL Id : hal-02551019 , version 1
DOI : 10.1007/s11042-020-08917-3

Citer

Pierre-Etienne Martin, Jenny Benois-Pineau, Renaud Péteri, Julien Morlier. Fine grained sport action recognition with Twin spatio-temporal convolutional neural networks. Multimedia Tools and Applications, 2020, ⟨10.1007/s11042-020-08917-3⟩. ⟨hal-02551019⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS IMS-BORDEAUX MIA IMS-BORDEAUX-FUSION UNIV-ROCHELLE

342 Consultations

311 Téléchargements

Fine grained sport action recognition with Twin spatio-temporal convolutional neural networks

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager