Skip to Main content Skip to Navigation
Poster communications

Fast Cascaded Action Localization In Video Using Frame Alignment

Abstract : Locating human actions in videos is challenging because ofthe complexity and variability of human motions, as well asof the amount of video data to be searched. Different actionscan be composed of similar short motions and only differ bytheir temporal ordering or relative durations. We propose amethod that detects and locates a set of actions in a videodatabase by taking into account their temporal structure at theframe level. While other methods aggregate frames into ac-tion parts, we leverage the complementarity between aggre-gation and frame level comparison of sequences. With theaim to address large scale retrieval, we introduce a two-levelcascade. The first level employs inexpensive aggregation tofilter out a large part of the video. The second level of thecascade applies to the remaining sequences more discrimi-nant comparisons using frame alignment. Evaluations on the?Smoking and Drinking? and ?MSR Action II? datasets showstate of the art results, as well as efficient detection and lowstorage requirements.
Document type :
Poster communications
Complete list of metadata
Contributor : Laboratoire Cedric <>
Submitted on : Friday, March 6, 2015 - 12:00:16 PM
Last modification on : Monday, September 7, 2020 - 12:10:03 PM


  • HAL Id : hal-01126535, version 1


Andrei Stoian, Marin Ferecatu, Jenny Benois-Pineau, Michel Crucianu. Fast Cascaded Action Localization In Video Using Frame Alignment. International Workshop on Computational Intelligence for Multimedia Understanding (IWCIM) 2014, Oct 2014, X, France. 2014. ⟨hal-01126535⟩



Record views