I. Laptev, On Space-Time Interest Points, International Journal of Computer Vision, vol.17, issue.8, pp.107-123, 2005.
DOI : 10.1007/s11263-005-1838-7

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.58.1419

M. Bregonzio, S. Gong, and T. Xiang, Recognising action as clouds of space-time interest points, 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009.
DOI : 10.1109/CVPR.2009.5206779

P. Dollár, V. Rabaud, G. Cottrell, and S. Belongie, Behavior Recognition via Sparse Spatio-Temporal Features, 2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance, 2005.
DOI : 10.1109/VSPETS.2005.1570899

G. Willems, T. Tuytelaars, and L. Gool, An Efficient Dense and Scale-Invariant Spatio-Temporal Interest Point Detector, European Conference on Computer Vision, 2008.
DOI : 10.1007/978-3-540-88688-4_48

I. Laptev, M. Marsza?ek, C. Schmid, and B. Rozenfeld, Learning realistic human actions from movies, 2008 IEEE Conference on Computer Vision and Pattern Recognition, 2008.
DOI : 10.1109/CVPR.2008.4587756

URL : https://hal.archives-ouvertes.fr/inria-00548659

C. Schüldt, I. Laptev, and B. Caputo, Recognizing human actions: a local SVM approach, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004., 2004.
DOI : 10.1109/ICPR.2004.1334462

P. Scovanner, S. Ali, and M. Shah, A 3-dimensional sift descriptor and its application to action recognition, Proceedings of the 15th international conference on Multimedia , MULTIMEDIA '07, 2007.
DOI : 10.1145/1291233.1291311

A. Kläser, M. Marsza?ek, and C. Schmid, A Spatio-Temporal Descriptor Based on 3D-Gradients, Procedings of the British Machine Vision Conference 2008, 2008.
DOI : 10.5244/C.22.99

L. Yeffet and L. Wolf, Local Trinary Patterns for human action recognition, 2009 IEEE 12th International Conference on Computer Vision, 2009.
DOI : 10.1109/ICCV.2009.5459201

P. Matikainen, M. Hebert, and R. Sukthankar, Trajectons: Action recognition through the motion analysis of tracked features, 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops, 2009.
DOI : 10.1109/ICCVW.2009.5457659

R. Messing, C. Pal, and H. Kautz, Activity recognition using the velocity histories of tracked keypoints, 2009 IEEE 12th International Conference on Computer Vision, 2009.
DOI : 10.1109/ICCV.2009.5459154

J. Sun, X. Wu, S. Yan, L. Cheong, T. Chua et al., Hierarchical spatio-temporal context modeling for action recognition, IEEE Conference on Computer Vision and Pattern Recognition, 2009.

J. Sun, Y. Mu, S. Yan, and L. Cheong, Activity recognition using dense long-duration trajectories, 2010 IEEE International Conference on Multimedia and Expo, 2010.
DOI : 10.1109/ICME.2010.5583046

B. D. Lucas and T. Kanade, An iterative image registration technique with an application to stereo vision, International Joint Conference on Artificial Intelligence, 1981.

L. Fei-fei and P. Perona, A Bayesian Hierarchical Model for Learning Natural Scene Categories, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), 2005.
DOI : 10.1109/CVPR.2005.16

E. Nowak, F. Jurie, and B. Triggs, Sampling Strategies for Bag-of-Features Image Classification, European Conference on Computer Vision, 2006.
DOI : 10.1007/11744085_38

URL : https://hal.archives-ouvertes.fr/hal-00203752

H. Wang, M. M. Ullah, A. Kläser, I. Laptev, and C. Schmid, Evaluation of local spatio-temporal features for action recognition, Procedings of the British Machine Vision Conference 2009, 2009.
DOI : 10.5244/C.23.124

URL : https://hal.archives-ouvertes.fr/inria-00439769

P. Sand and S. Teller, Particle Video: Long-Range Motion Estimation Using Point Trajectories, International Journal of Computer Vision, vol.30, issue.3, pp.72-91, 2008.
DOI : 10.1007/s11263-008-0136-6

T. Brox and J. Malik, Object Segmentation by Long Term Analysis of Point Trajectories, European Conference on Computer Vision, 2010.
DOI : 10.1007/978-3-642-15555-0_21

H. Wang, A. Kläser, C. Schmid, and C. Liu, Action recognition by dense trajectories, CVPR 2011, 2011.
DOI : 10.1109/CVPR.2011.5995407

URL : https://hal.archives-ouvertes.fr/inria-00583818

N. Dalal, B. Triggs, and C. Schmid, Human Detection Using Oriented Histograms of Flow and Appearance, European Conference on Computer Vision, 2006.
DOI : 10.1023/A:1008162616689

URL : https://hal.archives-ouvertes.fr/inria-00548587

S. Wong, R. Cipolla, and D. G. Lowe, Extracting spatiotemporal interest points using global information Distinctive image features from scale-invariant keypoints, IEEE International Conference on Computer Vision, pp.91-110, 2004.

H. Bay, T. Tuytelaars, and L. V. , SURF: Speeded up robust features, European Conference on Computer Vision, 2006.

T. Ojala, M. Pietikainen, and T. Maenpaa, Multiresolution gray-scale and rotation invariant texture classification with local binary patterns, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.24, issue.7, pp.971-987, 2002.
DOI : 10.1109/TPAMI.2002.1017623

M. Raptis and S. Soatto, Tracklet Descriptors for Action Modeling and Video Analysis, European Conference on Computer Vision, 2010.
DOI : 10.1007/978-3-642-15549-9_42

W. Lu, Y. F. Wang, and C. Chen, Learning Dense Optical-Flow Trajectory Patterns for Video Object Extraction, 2010 7th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2010.
DOI : 10.1109/AVSS.2010.79

N. Johnson and D. Hogg, Learning the distribution of object trajectories for event recognition, Image and Vision Computing, vol.14, issue.8, pp.609-615, 1996.
DOI : 10.1016/0262-8856(96)01101-8

N. Anjum and A. Cavallaro, Multifeature Object Trajectory Clustering for Video Analysis, IEEE Transactions on Circuits and Systems for Video Technology, vol.18, issue.11, pp.1555-1564, 2008.
DOI : 10.1109/TCSVT.2008.2005603

C. R. Jung, L. Hennemann, and S. R. Musse, Event Detection Using Trajectory Clustering and 4-D Histograms, IEEE Transactions on Circuits and Systems for Video Technology, pp.1565-1575, 2008.
DOI : 10.1109/TCSVT.2008.2005600

A. Hervieu, P. Bouthemy, and J. L. Cadre, A Statistical Video Content Recognition Method Using Invariant Features on Object Trajectories, IEEE Transactions on Circuits and Systems for Video Technology, pp.1533-1543, 2008.
DOI : 10.1109/TCSVT.2008.2005609

X. Wang, K. T. Ma, G. Ng, and W. E. Grimson, Trajectory Analysis and Semantic Region Modeling Using Nonparametric Hierarchical Bayesian Models, IEEE International Conference on Computer Vision, 2008.
DOI : 10.1007/s11263-011-0459-6

G. Piriou, P. Bouthemy, and J. Yao, Recognition of Dynamic Video Contents With Global Probabilistic Models of Visual Motion, IEEE Transactions on Image Processing, vol.15, issue.11, pp.3418-3431, 2006.
DOI : 10.1109/TIP.2006.881963

URL : https://hal.archives-ouvertes.fr/hal-00453197

H. Uemura, S. Ishikawa, and K. Mikolajczyk, Feature Tracking and Motion Compensation for Action Recognition, Procedings of the British Machine Vision Conference 2008, 2008.
DOI : 10.5244/C.22.30

N. Ikizler-cinbis and S. Sclaroff, Object, Scene and Actions: Combining Multiple Features for Human Action Recognition, European Conference on Computer Vision, 2010.
DOI : 10.1007/978-3-642-15549-9_36

S. Wu, O. Oreifej, and M. Shah, Action recognition in videos acquired by a moving camera using motion decomposition of Lagrangian particle trajectories, 2011 International Conference on Computer Vision, 2011.
DOI : 10.1109/ICCV.2011.6126397

J. Shi and C. Tomasi, Good features to track, IEEE Conference on Computer Vision and Pattern Recognition, 1994.

N. Sundaram, T. Brox, and K. Keutzer, Dense Point Trajectories by GPU-Accelerated Large Displacement Optical Flow, European Conference on Computer Vision, 2010.
DOI : 10.1007/978-3-642-15549-9_32

G. Farnebäck, Two-Frame Motion Estimation Based on Polynomial Expansion, Proceedings of the Scandinavian Conference on Image Analysis, 2003.
DOI : 10.1007/3-540-45103-X_50

N. Dalal and B. Triggs, Histograms of Oriented Gradients for Human Detection, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), 2005.
DOI : 10.1109/CVPR.2005.177

URL : https://hal.archives-ouvertes.fr/inria-00548512

T. Brox and J. Malik, Large Displacement Optical Flow: Descriptor Matching in Variational Motion Estimation, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.33, issue.3, pp.500-513, 2011.
DOI : 10.1109/TPAMI.2010.143

J. Zhang, M. Marsza?ek, S. Lazebnik, and C. Schmid, Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study, International Journal of Computer Vision, vol.36, issue.1, pp.213-238, 2007.
DOI : 10.1007/s11263-006-9794-4

URL : https://hal.archives-ouvertes.fr/inria-00548574

M. M. Ullah, S. N. Parizi, and I. Laptev, Improving bag-of-features action recognition with nonlocal cues, British Machine Vision Conference, 2010.

S. Lazebnik, C. Schmid, and J. Ponce, Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 2 (CVPR'06), 2006.
DOI : 10.1109/CVPR.2006.68

URL : https://hal.archives-ouvertes.fr/inria-00548585

J. Liu, J. Luo, and M. Shah, Recognizing realistic actions from videos in the wild, IEEE Conference on Computer Vision and Pattern Recognition, 2009.

M. Marsza?ek, I. Laptev, and C. Schmid, Actions in context, 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009.
DOI : 10.1109/CVPR.2009.5206557

M. D. Rodriguez, J. Ahmed, and M. Shah, Action MACH a spatio-temporal Maximum Average Correlation Height filter for action recognition, 2008 IEEE Conference on Computer Vision and Pattern Recognition, 2008.
DOI : 10.1109/CVPR.2008.4587727

D. Weinland, E. Boyer, and R. Ronfard, Action Recognition from Arbitrary Views using 3D Exemplars, 2007 IEEE 11th International Conference on Computer Vision, 2007.
DOI : 10.1109/ICCV.2007.4408849

URL : https://hal.archives-ouvertes.fr/inria-00544741

J. C. Niebles, C. Chen, and L. Fei-fei, Modeling Temporal Structure of Decomposable Motion Segments for Activity Classification, European Conference on Computer Vision, 2010.
DOI : 10.1007/978-3-642-15552-9_29

D. Tran and A. Sorokin, Human Activity Recognition with Metric Learning, European Conference on Computer Vision, 2008.
DOI : 10.1007/978-3-540-88682-2_42

K. Reddy and M. Shah, Recognizing 50 human action categories of web videos, Machine Vision and Applications, pp.1-11, 2012.
DOI : 10.1007/s00138-012-0450-4

H. Kuehne, H. Jhuang, E. Garrote, T. Poggio, and T. Serre, HMDB: A large video database for human motion recognition, 2011 International Conference on Computer Vision, pp.2556-2563, 2011.
DOI : 10.1109/ICCV.2011.6126543

A. Kovashka and K. Grauman, Learning a hierarchy of discriminative space-time neighborhood features for human action recognition, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010.
DOI : 10.1109/CVPR.2010.5539881

G. W. Taylor, R. Fergus, Y. Lecun, and C. Bregler, Convolutional learning of spatio-temporal features Discriminative video pattern search for efficient action detection, European Conference on Computer Vision, pp.1728-1743, 2010.

W. Brendel and S. Todorovic, Activities as Time Series of Human Postures, European Conference on Computer Vision, 2010.
DOI : 10.1007/978-3-642-15552-9_52

Q. V. Le, W. Y. Zou, S. Y. Yeung, and A. Y. Ng, Learning hierarchical invariant spatio-temporal features for action recognition with independent subspace analysis, CVPR 2011, 2011.
DOI : 10.1109/CVPR.2011.5995496

A. Gilbert, J. Illingworth, and R. Bowden, Action Recognition Using Mined Hierarchical Compound Features, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.33, issue.5, pp.883-897, 2011.
DOI : 10.1109/TPAMI.2010.144

S. Bhattacharya, R. Sukthankar, R. Jin, and M. Shah, A probabilistic representation for efficient large scale visual recognition tasks, CVPR 2011, 2011.
DOI : 10.1109/CVPR.2011.5995746

A. Kläser, M. Marsza?ek, I. Laptev, and C. Schmid, Will person detection help bag-of-features action recognition, 2010.

I. N. Junejo, E. Dexter, I. Laptev, and P. Pérez, View-Independent Action Recognition from Temporal Self-Similarities, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.33, issue.1, pp.172-185, 2011.
DOI : 10.1109/TPAMI.2010.68

URL : https://hal.archives-ouvertes.fr/hal-01064695

W. Brendel and S. Todorovic, Learning spatiotemporal graphs of human activities, 2011 International Conference on Computer Vision, 2011.
DOI : 10.1109/ICCV.2011.6126316

X. Wu, D. Xu, L. Duan, and J. Luo, Action recognition using context and appearance distribution features, CVPR 2011, 2011.
DOI : 10.1109/CVPR.2011.5995624

A. Gaidon, Z. Harchaoui, and C. Schmid, Recognizing activities with cluster-trees of tracklets, Procedings of the British Machine Vision Conference 2012, 2012.
DOI : 10.5244/C.26.30

URL : https://hal.archives-ouvertes.fr/hal-00722955

O. Kliper-gross, Y. Gurovich, T. Hassner, and L. Wolf, Motion Interchange Patterns for Action Recognition in Unconstrained Videos, European Conference on Computer Vision, 2012.
DOI : 10.1007/978-3-642-33783-3_19

S. Sadanand and J. J. Corso, Action bank: A high-level representation of activity in video, 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012.
DOI : 10.1109/CVPR.2012.6247806

D. Weinland, R. Ronfard, and E. Boyer, Free viewpoint action recognition using motion history volumes, Inria RESEARCH CENTRE GRENOBLE ? RHÔNE-ALPES Inovallée 655 avenue de l'Europe Montbonnot 38334 Saint Ismier Cedex Publisher Inria Domaine de Voluceau -Rocquencourt BP 105 -78153 Le Chesnay Cedex inria.fr ISSN, pp.249-257, 2006.
DOI : 10.1016/j.cviu.2006.07.013

URL : https://hal.archives-ouvertes.fr/inria-00544629