S. Ali and M. Shah, Human Action Recognition in Videos Using Kinematic Features and Multiple Instance Learning, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.32, issue.2, pp.288-303, 2010.
DOI : 10.1109/TPAMI.2008.284

W. Brendel and S. Todorovic, Learning spatiotemporal graphs of human activities, 2011 International Conference on Computer Vision, 2011.
DOI : 10.1109/ICCV.2011.6126316

T. Brox and J. Malik, Object Segmentation by Long Term Analysis of Point Trajectories, ECCV, 2010.
DOI : 10.1007/978-3-642-15555-0_21

N. Dalal and B. Triggs, Histograms of Oriented Gradients for Human Detection, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), 2005.
DOI : 10.1109/CVPR.2005.177

URL : https://hal.archives-ouvertes.fr/inria-00548512

N. Dalal, B. Triggs, and C. Schmid, Human Detection Using Oriented Histograms of Flow and Appearance, ECCV, 2006.
DOI : 10.1023/A:1008162616689

URL : https://hal.archives-ouvertes.fr/inria-00548587

P. Dollar, V. Rabaud, G. Cottrell, and S. Belongie, Behavior Recognition via Sparse Spatio-Temporal Features, 2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance, 2005.
DOI : 10.1109/VSPETS.2005.1570899

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.77.5712

A. Hervieu, P. Bouthemy, and L. Cadre, A Statistical Video Content Recognition Method Using Invariant Features on Object Trajectories, IEEE Transactions on Circuits and Systems for Video Technology, vol.18, issue.11, pp.1533-1543, 2008.
DOI : 10.1109/TCSVT.2008.2005609

H. Jégou, F. Perronnin, M. Douze, J. Sánchez, P. Pérez et al., Aggregating Local Image Descriptors into Compact Codes, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.34, issue.9, pp.1704-1716, 2012.
DOI : 10.1109/TPAMI.2011.235

Y. Jiang, Q. Dai, X. Xue, W. Liu, and C. Ngo, Trajectory-Based Modeling of Human Actions with Motion Reference Points, ECCV, 2012.
DOI : 10.1007/978-3-642-33715-4_31

O. Kliper-gross, Y. Gurovich, T. Hassner, and L. Wolf, Motion Interchange Patterns for Action Recognition in Unconstrained Videos, ECCV, 2012.
DOI : 10.1007/978-3-642-33783-3_19

H. Kuehne, H. Jhuang, E. Garrote, T. Poggio, and T. Serre, HMDB: A large video database for human motion recognition, 2011 International Conference on Computer Vision, 2011.
DOI : 10.1109/ICCV.2011.6126543

I. Laptev, M. Marzalek, C. Schmid, and B. Rozenfeld, Learning realistic human actions from movies, 2008 IEEE Conference on Computer Vision and Pattern Recognition, 2008.
DOI : 10.1109/CVPR.2008.4587756

URL : https://hal.archives-ouvertes.fr/inria-00548659

D. Lowe, Distinctive Image Features from Scale-Invariant Keypoints, International Journal of Computer Vision, vol.60, issue.2, pp.91-110, 2004.
DOI : 10.1023/B:VISI.0000029664.99615.94

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.14.4931

M. Marzalek, I. Laptev, and C. Schmid, Actions in context, 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009.
DOI : 10.1109/CVPR.2009.5206557

P. Matikainen, M. Hebert, and R. Sukthankar, Trajectons: Action recognition through the motion analysis of tracked features, 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops, 2009.
DOI : 10.1109/ICCVW.2009.5457659

R. Messing, C. J. Pal, and H. A. Kautz, Activity recognition using the velocity histories of tracked keypoints, 2009 IEEE 12th International Conference on Computer Vision, 2009.
DOI : 10.1109/ICCV.2009.5459154

J. C. Niebles, C. Chen, and F. Li, Modeling Temporal Structure of Decomposable Motion Segments for Activity Classification, ECCV, 2010.
DOI : 10.1007/978-3-642-15552-9_29

J. Odobez and P. Bouthemy, Robust Multiresolution Estimation of Parametric Motion Models, Journal of Visual Communication and Image Representation, vol.6, issue.4, pp.348-365, 1995.
DOI : 10.1006/jvci.1995.1029

G. Piriou, P. Bouthemy, and J. Yao, Recognition of Dynamic Video Contents With Global Probabilistic Models of Visual Motion, IEEE Transactions on Image Processing, vol.15, issue.11, pp.153417-3430, 2006.
DOI : 10.1109/TIP.2006.881963

URL : https://hal.archives-ouvertes.fr/hal-00453197

S. Sadanand and J. J. Corso, Action bank: A high-level representation of activity in video, 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012.
DOI : 10.1109/CVPR.2012.6247806

C. Schmid and R. Mohr, Local grayvalue invariants for image retrieval, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.19, issue.5, pp.530-534, 1997.
DOI : 10.1109/34.589215

URL : https://hal.archives-ouvertes.fr/inria-00548358

J. Sivic and A. Zisserman, Video Google: a text retrieval approach to object matching in videos, Proceedings Ninth IEEE International Conference on Computer Vision, pp.1470-1477, 2003.
DOI : 10.1109/ICCV.2003.1238663

J. Sun, X. Wu, S. Yan, L. F. Cheong, T. Chua et al., Hierarchical spatio-temporal context modeling for action recognition, CVPR, 2009.

H. Uemura, S. Ishikawa, and K. Mikolajczyk, Feature Tracking and Motion Compensation for Action Recognition, Procedings of the British Machine Vision Conference 2008, 2008.
DOI : 10.5244/C.22.30

M. M. Ullah, S. N. Parizi, and I. Laptev, Improving bag-offeatures action recognition with non-local cues, BMVC, 2010.
DOI : 10.5244/c.24.95

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.174.6987

E. Vig, M. Dorr, and D. Cox, Saliency-based space-variant descriptor sampling for action recognition, ECCV, 2012.
DOI : 10.1007/978-3-642-33786-4_7

H. Wang, A. Kläser, C. Schmid, and C. Liu, Action recognition by dense trajectories, CVPR 2011, 2011.
DOI : 10.1109/CVPR.2011.5995407

URL : https://hal.archives-ouvertes.fr/inria-00583818

H. Wang, M. M. Ullah, A. Kläser, I. Laptev, and C. Schmid, Evaluation of local spatio-temporal features for action recognition, Procedings of the British Machine Vision Conference 2009, 2009.
DOI : 10.5244/C.23.124

URL : https://hal.archives-ouvertes.fr/inria-00439769

G. Willems, T. Tuytelaars, and L. J. , An Efficient Dense and Scale-Invariant Spatio-Temporal Interest Point Detector, ECCV, 2008.
DOI : 10.1007/978-3-540-88688-4_48

S. Wu, O. Oreifej, and M. Shah, Action recognition in videos acquired by a moving camera using motion decomposition of Lagrangian particle trajectories, 2011 International Conference on Computer Vision, 2011.
DOI : 10.1109/ICCV.2011.6126397

J. Yuan, Z. Liu, and Y. Wu, Discriminative subvolume search for efficient action detection, CVPR, 2009.