Focused Attention for Action Recognition

Vladyslav Sydorov; Karteek Alahari; Cordelia Schmid

Communication Dans Un Congrès Année : 2019

Focused Attention for Action Recognition

(1) , (1) , (1)

Vladyslav Sydorov

Fonction : Auteur
PersonId : 1054422

Apprentissage de modèles à partir de données massives

Karteek Alahari

Fonction : Auteur
PersonId : 19670
IdHAL : karteek
ORCID : 0000-0002-1838-5936
IdRef : 196283892

Apprentissage de modèles à partir de données massives

Cordelia Schmid

Fonction : Auteur
PersonId : 831154

Apprentissage de modèles à partir de données massives

Résumé

Current state-of-the art approaches to action recognition emphasize learning ConvNets on large amounts of training data, using 3D convolutions to process the temporal dimension. This approach is expensive in terms of memory usage and constitutes a major performance bottleneck of existing approaches. Further, video input data points typically include irrelevant information, along with useful features, which limits the level of detail that networks can process, regardless of the quality of the original video. Hence, models that can focus computational resources on relevant training signal are desirable.To address this problem, we rely on network-specific saliency outputs to drive an attention model that provides tighter crops around relevant video regions. We experimentally validate this approach and show how this strategy improves performance for the action recognition task.

Domaines

Vision par ordinateur et reconnaissance de formes [cs.CV]

Fichier principal

paper.pdf (224.53 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Vladyslav Sydorov : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02292339

Soumis le : jeudi 19 septembre 2019-16:29:49

Dernière modification le : jeudi 4 avril 2024-21:40:18

Archivage à long terme le : samedi 8 février 2020-21:44:27

Dates et versions

hal-02292339 , version 1 (19-09-2019)

Identifiants

HAL Id : hal-02292339 , version 1

Citer

Vladyslav Sydorov, Karteek Alahari, Cordelia Schmid. Focused Attention for Action Recognition. BMVC 2019 - British Machine Vision Conference, Sep 2019, Cardiff, United Kingdom. pp.1-13. ⟨hal-02292339⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA CNRS INRIA LJK LJK_GI INRIA2 LJK-GI-THOTH ANR

660 Consultations

617 Téléchargements

Focused Attention for Action Recognition

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager