M. Andriluka, L. Pishchulin, P. Gehler, and B. Schiele, 2D Human Pose Estimation: New Benchmark and State of the Art Analysis, 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014.
DOI : 10.1109/CVPR.2014.471

URL : http://ps.is.tue.mpg.de/publications/168/get_file/

M. Andriluka, S. Roth, and B. Schiele, Pictorial structures revisited: People detection and articulated pose estimation, 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp.1014-1021, 2009.
DOI : 10.1109/CVPR.2009.5206754

URL : http://www.gris.informatik.tu-darmstadt.de/~sroth/pubs/cvpr09andriluka.pdf

F. Baradel, C. Wolf, and J. Mille, Pose-conditioned spatio-temporal attention for human action recognition . arxiv, 1703.
URL : https://hal.archives-ouvertes.fr/hal-01593548

C. Cao, Y. Zhang, C. Zhang, and H. Lu, Body joint guided 3d deep convolutional descriptors for action recognition, 1704.
DOI : 10.1109/tcyb.2017.2756840

URL : http://arxiv.org/pdf/1704.07160

J. Carreira, P. Agrawal, K. Fragkiadaki, and J. Malik, Human Pose Estimation with Iterative Error Feedback, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
DOI : 10.1109/CVPR.2016.512

URL : http://arxiv.org/pdf/1507.06550

G. Ch-'eron, I. Laptev, and C. Schmid, P-CNN: Posebased CNN Features for Action Recognition, IEEE International Conference on Computer Vision (ICCV), 2015.

X. Chu, W. Yang, W. Ouyang, C. Ma, A. L. Yuille et al., Multi-context Attention for Human Pose Estimation, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017.
DOI : 10.1109/CVPR.2017.601

URL : http://arxiv.org/pdf/1702.07432

M. Dantone, J. Gall, C. Leistner, and L. V. , Human Pose Estimation Using Body Parts Dependent Joint Regressors, 2013 IEEE Conference on Computer Vision and Pattern Recognition, pp.3041-3048, 2013.
DOI : 10.1109/CVPR.2013.391

URL : https://lirias.kuleuven.be/bitstream/123456789/398648/2/3601_open+access.pdf

S. Herath, M. Harandi, and F. Porikli, Going deeper into action recognition: A survey, Image and Vision Computing, vol.60, pp.4-21, 2017.
DOI : 10.1016/j.imavis.2017.01.010

URL : http://arxiv.org/pdf/1605.04988

C. Ionescu, D. Papava, V. Olaru, and C. Sminchisescu, Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.36, issue.7, pp.1325-1339, 2014.
DOI : 10.1109/TPAMI.2013.248

U. Iqbal, M. Garbade, and J. Gall, Pose for Action - Action for Pose, 2017 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017), 2017.
DOI : 10.1109/FG.2017.61

I. Kokkinos, UberNet: Training a Universal Convolutional Neural Network for Low-, Mid-, and High-Level Vision Using Diverse Datasets and Limited Memory, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), p.2017
DOI : 10.1109/CVPR.2017.579

I. Lifshitz, E. Fetaya, and S. Ullman, Human Pose Estimation Using Deep Consensus Voting, pp.246-260
DOI : 10.1109/CVPR.2011.5995741

URL : http://arxiv.org/pdf/1603.08212

J. Liu, A. Shahroudy, D. Xu, and G. Wang, Spatiotemporal lstm with trust gates for 3d human action recognition, European Conference on Computer Vision (ECCV), pp.816-833, 2016.

J. Liu, G. Wang, P. Hu, L. Duan, and A. C. Kot, Global Context-Aware Attention LSTM Networks for 3D Action Recognition, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017.
DOI : 10.1109/CVPR.2017.391

D. C. Luvizon, H. Tabia, and D. Picard, Learning features combination for human action recognition from skeleton sequences, Pattern Recognition Letters, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01515376

J. Martinez, R. Hossain, J. Romero, and J. J. Little, A Simple Yet Effective Baseline for 3d Human Pose Estimation, 2017 IEEE International Conference on Computer Vision (ICCV), 2017.
DOI : 10.1109/ICCV.2017.288

D. Mehta, H. Rhodin, D. Casas, O. Sotnychenko, W. Xu et al., Monocular 3d human pose estimation using transfer learning and improved CNN supervision, 2016.

D. Mehta, S. Sridhar, O. Sotnychenko, H. Rhodin, M. Shafiei et al., VNect, ACM Transactions on Graphics, vol.36, issue.4, 2017.
DOI : 10.1145/2601097.2601165

URL : http://arxiv.org/pdf/1705.01583

A. Newell, K. Yang, and J. Deng, Stacked Hourglass Networks for Human Pose Estimation. European Conference on Computer Vision (ECCV), pp.483-499, 2016.

G. Ning, Z. Zhang, and Z. He, Knowledge-Guided Deep Fractal Neural Networks for Human Pose Estimation, IEEE Transactions on Multimedia, vol.20, issue.5, pp.1-1, 2017.
DOI : 10.1109/TMM.2017.2762010

G. Pavlakos, X. Zhou, K. G. Derpanis, and K. Daniilidis, Coarse-to-fine volumetric prediction for singleimage 3D human pose, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017.

L. Pishchulin, M. Andriluka, P. Gehler, and B. Schiele, Poselet Conditioned Pictorial Structures, 2013 IEEE Conference on Computer Vision and Pattern Recognition, pp.588-595, 2013.
DOI : 10.1109/CVPR.2013.82

URL : http://www.cv-foundation.org/openaccess/content_cvpr_2013/papers/Pishchulin_Poselet_Conditioned_Pictorial_2013_CVPR_paper.pdf

L. Pishchulin, E. Insafutdinov, S. Tang, B. Andres, M. Andriluka et al., DeepCut: Joint Subset Partition and Labeling for Multi Person Pose Estimation, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016.
DOI : 10.1109/CVPR.2016.533

U. Rafi, I. Kostrikov, J. Gall, and B. Leibe, An Efficient Convolutional Network for Human Pose Estimation, Procedings of the British Machine Vision Conference 2016, 2016.
DOI : 10.5244/C.30.109

N. Sarafianos, B. Boteanu, B. Ionescu, and I. A. , 3D Human pose estimation: A review of the literature and analysis of covariates, Computer Vision and Image Understanding, vol.152, pp.1-20, 2016.
DOI : 10.1016/j.cviu.2016.09.002

A. Shahroudy, J. Liu, T. Ng, and G. Wang, NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016.
DOI : 10.1109/CVPR.2016.115

]. A. Shahroudy, T. Ng, Y. Gong, and G. Wang, Deep Multimodal Feature Analysis for Action Recognition in RGB+D Videos, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.40, issue.5, 2017.
DOI : 10.1109/TPAMI.2017.2691321

J. Shotton, A. Fitzgibbon, M. Cook, T. Sharp, M. Finocchio et al., Real-time Human Pose Recognition in Parts from Single Depth Images, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), CVPR '11, pp.1297-1304, 2011.

S. Song, C. Lan, J. Xing, W. Z. , and J. Liu, An end-to-end spatio-temporal attention model for human action recognition from skeleton data, AAAI Conference on Artificial Intelligence, 2017.

X. Sun, J. Shang, S. Liang, and Y. Wei, Compositional Human Pose Regression, 2017 IEEE International Conference on Computer Vision (ICCV), 2017.
DOI : 10.1109/ICCV.2017.284

URL : http://arxiv.org/pdf/1704.00159

C. Szegedy, S. Ioffe, and V. Vanhoucke, Inception- v4, inception-resnet and the impact of residual connections on learning, 1602.

J. Tompson, R. Goroshin, A. Jain, Y. Lecun, and C. Bregler, Efficient object localization using Convolutional Networks, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.648-656, 2015.
DOI : 10.1109/CVPR.2015.7298664

URL : http://arxiv.org/pdf/1411.4280

A. Toshev and C. Szegedy, DeepPose: Human Pose Estimation via Deep Neural Networks, 2014 IEEE Conference on Computer Vision and Pattern Recognition, pp.1653-1660, 2014.
DOI : 10.1109/CVPR.2014.214

URL : http://arxiv.org/pdf/1312.4659

G. Varol, I. Laptev, and C. Schmid, Long-term Temporal Convolutions for Action Recognition. TPAMI, 2017.
DOI : 10.1109/tpami.2017.2712608

URL : https://hal.archives-ouvertes.fr/hal-01241518

B. Xiaohan-nie, C. Xiong, and S. Zhu, Joint action recognition and pose estimation from video, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015.

W. Yang, S. Li, W. Ouyang, H. Li, and X. Wang, Learning Feature Pyramids for Human Pose Estimation, 2017 IEEE International Conference on Computer Vision (ICCV), 2017.
DOI : 10.1109/ICCV.2017.144

URL : http://arxiv.org/pdf/1708.01101

K. M. Yi, E. Trulls, V. Lepetit, and P. Fua, LIFT: Learned Invariant Feature Transform. European Conference on Computer Vision (ECCV), 2016.

H. Zou and T. Hastie, Regularization and variable selection via the elastic net, Journal of the Royal Statistical Society: Series B (Statistical Methodology), vol.5, issue.2, pp.301-320, 2005.
DOI : 10.1073/pnas.201162998