S. Ji, W. Xu, M. Yang, and K. Yu, 3d convolutional neural networks for human action recognition, Pattern Analysis and Machine Intelligence, vol.35, pp.221-231, 2013.

K. Simonyan and A. Zisserman, Two-stream convolutional networks for action recognition in videos, Advances in Neural Information Processing Systems, pp.568-576, 2014.

Q. V. Le, W. Y. Zou, S. Y. Yeung, and A. Y. Ng, Learning hierarchical invariant spatiotemporal features for action recognition with independent subspace analysis, CVPR 2011 IEEE Conference on, pp.3361-3368, 2011.

D. Tran, L. Bourdev, R. Fergus, L. Torresani, and M. Paluri, Learning spatiotemporal features with 3d convolutional networks, 2015 IEEE International Conference on Computer Vision (ICCV), pp.4489-4497, 2015.

I. Laptev, On space-time interest points, International Journal of Computer Vision, vol.64, pp.107-123, 2005.

I. Laptev, M. Marsza-lek, C. Schmid, and B. Rozenfeld, Learning realistic human actions from movies, CVPR 2008. IEEE Conference on, pp.1-8, 2008.
URL : https://hal.archives-ouvertes.fr/inria-00548659

Q. Wei, X. Zhang, Y. Kong, W. Hu, and H. Ling, Group action recognition using space-time interest points, International Symposium on Visual Computing, pp.757-766, 2009.

H. Wang, A. Kläser, C. Schmid, and C. L. Liu, Dense trajectories and motion boundary descriptors for action recognition, International Journal of Computer Vision, vol.103, pp.60-79, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00725627

H. Wang, M. M. Ullah, A. Klaser, I. Laptev, and C. Schmid, Evaluation of local spatio-temporal features for action recognition, BMVC 2009-British Machine Vision Conference, pp.124-125, 2009.
URL : https://hal.archives-ouvertes.fr/inria-00439769

H. Wang and C. Schmid, Action recognition with improved trajectories, Proceedings of the IEEE International Conference on Computer Vision, pp.3551-3558, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00873267

A. Klaser, M. Marsza-lek, and C. Schmid, A spatio-temporal descriptor based on 3d-gradients, BMVC 2008-19th British Machine Vision Conference, British Machine Vision Association, pp.275-276, 2008.
URL : https://hal.archives-ouvertes.fr/inria-00514853

D. G. Lowe, Distinctive image features from scale-invariant keypoints, International Journal of Computer Vision, vol.60, pp.91-110, 2004.

N. Dalal, B. Triggs, and C. Schmid, Human detection using oriented histograms of flow and appearance, European conference on computer vision, pp.428-441, 2006.
URL : https://hal.archives-ouvertes.fr/inria-00548587

L. Yeffet and L. Wolf, Local trinary patterns for human action recognition, Computer Vision, IEEE 12th International Conference on, pp.492-497, 2009.

O. Kliper-gross, Y. Gurovich, T. Hassner, and L. Wolf, Motion interchange patterns for action recognition in unconstrained videos, European Conference on Computer Vision, pp.256-269, 2012.

H. Jégou, M. Douze, C. Schmid, and P. Pérez, Aggregating local descriptors into a compact image representation, CVPR 2010. IEEE Conference on, pp.3304-3311, 2010.

N. S. Vu and A. Caplier, Face recognition with patterns of oriented edge magnitudes, European conference on computer vision, pp.313-326, 2010.
URL : https://hal.archives-ouvertes.fr/hal-00528605

N. S. Vu, Exploring patterns of gradient orientations and magnitudes for face recognition, Information Forensics and Security, vol.8, pp.295-304, 2013.

M. Jain, H. Jégou, and P. Bouthemy, Better exploiting motion for better action recognition, CVPR 2013, pp.2555-2562, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00813014

V. Kantorov and I. Laptev, Efficient feature extraction, encoding and classification for action recognition, Proceedings of the IEEE Conference on CVPR, pp.2593-2600, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01058734

F. Perronnin and C. Dance, Fisher kernels on visual vocabularies for image categorization, 2007 IEEE Conference on Computer Vision and Pattern Recognition, pp.1-8, 2007.

C. C. Chang and C. J. Lin, Libsvm: a library for support vector machines, ACM TIST, vol.2, p.27, 2011.

C. Schuldt, I. Laptev, and B. Caputo, Recognizing human actions: a local svm approach, Proceedings of the 17th International Conference on, vol.3, pp.32-36, 2004.

M. D. Rodriguez, J. Ahmed, and M. Shah, Action mach a spatio-temporal maximum average correlation height filter for action recognition, CVPR, 2008 IEEE Conference on, pp.1-8, 2008.

S. Sadanand and J. J. Corso, Action bank: A high-level representation of activity in video, Computer Vision and Pattern Recognition (CVPR), pp.1234-1241, 2012.

A. Kovashka and K. Grauman, Learning a hierarchy of discriminative space-time neighborhood features for har, Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on 2010, pp.2046-2053, 2010.

G. W. Taylor, R. Fergus, Y. Lecun, and C. Bregler, Convolutional learning of spatiotemporal features, European conference on computer vision, pp.140-153, 2010.

L. Liu, L. Shao, X. Li, and K. Lu, Learning spatio-temporal representations for action recognition: A genetic programming approach. Cybernetics, IEEE Transactions, vol.46, pp.158-170, 2016.

A. Kläser, Learning human actions in video, 2010.