L. Bourdev, S. Maji, T. Brox, and J. Malik, Detecting People Using Mutually Consistent Poselet Activations, pp.168-181, 2010.
DOI : 10.1007/978-3-642-15567-3_13

L. Bourdev and J. Malik, Poselets: Body part detectors trained using 3D human pose annotations, 2009 IEEE 12th International Conference on Computer Vision, pp.1365-1372, 2009.
DOI : 10.1109/ICCV.2009.5459303

L. Campbell and A. Bobick, Recognition of human body motion using phase space constraints, Proceedings of IEEE International Conference on Computer Vision, pp.624-630, 1995.
DOI : 10.1109/ICCV.1995.466880

C. Chang and C. Lin, LIBSVM, ACM Transactions on Intelligent Systems and Technology, vol.2, issue.3, pp.27-28, 2011.
DOI : 10.1145/1961189.1961199

N. Dalal and B. Triggs, Histograms of Oriented Gradients for Human Detection, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), pp.886-893, 2005.
DOI : 10.1109/CVPR.2005.177

URL : https://hal.archives-ouvertes.fr/inria-00548512

N. Dalal, B. Triggs, and C. Schmid, Human Detection Using Oriented Histograms of Flow and Appearance, pp.428-441, 2006.
DOI : 10.1023/A:1008162616689

URL : https://hal.archives-ouvertes.fr/inria-00548587

M. Dantone, J. Gall, C. Leistner, and L. Van-gool, Human Pose Estimation Using Body Parts Dependent Joint Regressors, 2013 IEEE Conference on Computer Vision and Pattern Recognition, pp.3041-3048, 2013.
DOI : 10.1109/CVPR.2013.391

M. Eichner and V. Ferrari, Better appearance models for pictorial structures, Procedings of the British Machine Vision Conference 2009, pp.3-4, 2009.
DOI : 10.5244/C.23.3

G. Farneback, Two-Frame Motion Estimation Based on Polynomial Expansion, Image Analysis, issue.4, pp.363-370, 2003.
DOI : 10.1007/3-540-45103-X_50

V. Ferrari, M. Marin-jimenez, and A. Zisserman, Progressive search space reduction for human pose estimation, 2008 IEEE Conference on Computer Vision and Pattern Recognition, pp.1-8, 2008.
DOI : 10.1109/CVPR.2008.4587468

C. Ionescu, D. Papava, V. Olaru, and C. Sminchisescu, Human 3.6M: Large scale datasets and predictive methods for 3D human sensing in natural environments, p.3, 2014.

S. Johnson and M. Everingham, Clustered Pose and Nonlinear Appearance Models for Human Pose Estimation, Procedings of the British Machine Vision Conference 2010, pp.12-13, 2010.
DOI : 10.5244/C.24.12

H. Kuehne, H. Jhuang, E. Garrote, T. Poggio, and T. Serre, HMDB: A large video database for human motion recognition, 2011 International Conference on Computer Vision, pp.2556-2563, 2004.
DOI : 10.1109/ICCV.2011.6126543

F. D. La-torre, J. Hodgins, J. Montano, S. Valcarcel, R. Forcada et al., Guide to the Carnegie Mellon University multimodal activity (CMU-MMAC) database, p.3, 2009.

I. Laptev, M. Marszalek, C. Schmid, and B. Rozenfeld, Learning realistic human actions from movies, 2008 IEEE Conference on Computer Vision and Pattern Recognition, pp.1-8, 2004.
DOI : 10.1109/CVPR.2008.4587756

URL : https://hal.archives-ouvertes.fr/inria-00548659

J. Niebles, C. Chen, and L. Fei-fei, Modeling Temporal Structure of Decomposable Motion Segments for Activity Classification, pp.392-405, 2010.
DOI : 10.1007/978-3-642-15552-9_29

F. Ofli, R. Chaudhry, G. Kurillo, R. Vidal, and R. Bajcsy, Berkeley MHAD: A comprehensive Multimodal Human Action Database, 2013 IEEE Workshop on Applications of Computer Vision (WACV), pp.53-60, 2013.
DOI : 10.1109/WACV.2013.6474999

D. Parikh and C. L. Zitnick, The role of features, algorithms and data in visual recognition, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.2328-2335, 2010.
DOI : 10.1109/CVPR.2010.5539920

K. Reddy and M. Shah, Recognizing 50 human action categories of web videos, Machine Vision and Applications, vol.24, issue.5, pp.971-981, 2013.
DOI : 10.1007/s00138-012-0450-4

M. Rohrbach, S. Amin, M. Andriluka, and B. Schiele, A database for fine grained activity detection of cooking activities, 2012 IEEE Conference on Computer Vision and Pattern Recognition, pp.1194-1201, 2012.
DOI : 10.1109/CVPR.2012.6247801

C. Rother, V. Kolmogorov, and A. Blake, GrabCut: Interactive foreground extraction using iterated graph cuts, pp.309-314, 2004.

B. Sapp, D. Weiss, and B. Taskar, Parsing human motion with stretchable models, CVPR 2011, pp.1281-1288, 2011.
DOI : 10.1109/CVPR.2011.5995607

L. Sigal, A. Balan, and M. Black, HumanEva: Synchronized Video and Motion Capture Dataset and Baseline Algorithm for Evaluation of Articulated Human??Motion, International Journal of Computer Vision, vol.74, issue.3, pp.4-27, 2010.
DOI : 10.1007/s11263-009-0273-6

V. K. Singh and R. Nevatia, Action recognition in cluttered dynamic scenes using Pose-Specific Part Models, 2011 International Conference on Computer Vision, pp.113-120, 2011.
DOI : 10.1109/ICCV.2011.6126232

D. Sun, S. Roth, and M. Black, A quantitative analysis of current practices in optical flow estimation and the principles behind them. IJCV, to appear, p.5, 2013.

M. Tenorth, J. Bandouch, and M. Beetz, The TUM Kitchen Data Set of everyday manipulation activities for motion tracking and action recognition, 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops, pp.1089-1096, 2009.
DOI : 10.1109/ICCVW.2009.5457583

K. Tran, I. Kakadiaris, and S. Shah, Modeling Motion of Body Parts for Action Recognition, Procedings of the British Machine Vision Conference 2011, pp.64-65, 2011.
DOI : 10.5244/C.25.64

H. Wang, A. Kläser, C. Schmid, and C. Liu, Dense Trajectories and Motion Boundary Descriptors for Action Recognition, International Journal of Computer Vision, vol.73, issue.2, pp.60-79, 2007.
DOI : 10.1007/s11263-012-0594-8

URL : https://hal.archives-ouvertes.fr/hal-00725627

D. Weinland, R. Ronfard, and E. Boyer, A survey of visionbased methods for action representation, segmentation and recognition, CVIU, vol.115, issue.2 1, pp.224-241, 2010.
URL : https://hal.archives-ouvertes.fr/inria-00459653

Y. Yacoob and M. Black, Parameterized modeling and recognition of activities, CVIU, vol.73, issue.2 1, pp.232-247, 1999.

Y. Yang and D. Ramanan, Articulated human detection with flexible mixtures of parts. PAMI, to appear, p.7

A. Yao, J. Gall, and L. Van-gool, Coupled Action Recognition and Pose Estimation from Multiple Views, International Journal of Computer Vision, vol.73, issue.2, pp.16-37, 2012.
DOI : 10.1007/s11263-012-0532-9

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.308.5466

S. Zuffi and M. Black, Puppet flow, p.3, 2013.

S. Zuffi, O. Freifeld, and M. Black, From Pictorial Structures to deformable structures, 2012 IEEE Conference on Computer Vision and Pattern Recognition, pp.3546-3553, 2012.
DOI : 10.1109/CVPR.2012.6248098