Skip to Main content Skip to Navigation
Conference papers

Focused Attention for Action Recognition

Abstract : Current state-of-the art approaches to action recognition emphasize learning ConvNets on large amounts of training data, using 3D convolutions to process the temporal dimension. This approach is expensive in terms of memory usage and constitutes a major performance bottleneck of existing approaches. Further, video input data points typically include irrelevant information, along with useful features, which limits the level of detail that networks can process, regardless of the quality of the original video. Hence, models that can focus computational resources on relevant training signal are desirable.To address this problem, we rely on network-specific saliency outputs to drive an attention model that provides tighter crops around relevant video regions. We experimentally validate this approach and show how this strategy improves performance for the action recognition task.
Document type :
Conference papers
Complete list of metadatas

Cited literature [43 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-02292339
Contributor : Vladyslav Sydorov <>
Submitted on : Thursday, September 19, 2019 - 4:29:49 PM
Last modification on : Monday, October 19, 2020 - 11:31:00 AM
Long-term archiving on: : Saturday, February 8, 2020 - 9:44:27 PM

File

paper.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-02292339, version 1

Collections

Citation

Vladyslav Sydorov, Karteek Alahari, Cordelia Schmid. Focused Attention for Action Recognition. BMVC 2019 - British Machine Vision Conference, Sep 2019, Cardiff, United Kingdom. pp.1-13. ⟨hal-02292339⟩

Share

Metrics

Record views

645

Files downloads

901