N. C. Camgoz, A. A. Kindiroglu, and L. Akarun, Gesture Recognition Using Template Based Random Forest Classifiers, 2014.
DOI : 10.1007/978-3-319-16178-5_41

J. Y. Chang, Nonparametric Gesture Labeling from Multi-modal Data, 2014.
DOI : 10.1007/978-3-319-16178-5_35

R. Chaudhry, F. Ofli, G. Kurillo, R. Bajcsy, and R. Vidal, Bio-inspired Dynamic 3D Discriminative Skeletal Features for Human Action Recognition, 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2013.
DOI : 10.1109/CVPRW.2013.153

G. Chen, D. Clarke, D. Weikersdorfer, M. Giuliani, A. Gaschler et al., Multi-modality Gesture Detection and Recognition with Un-supervision, Randomization and Discrimination, 2014.
DOI : 10.1007/978-3-319-16178-5_43

S. Escalera, X. Bar, J. Gonzlez, M. A. Bautista, M. Madadi et al., ChaLearn Looking at People Challenge 2014: Dataset and Results, 2014.
DOI : 10.1007/978-3-319-16178-5_32

URL : https://hal.archives-ouvertes.fr/hal-01381162

G. Evangelidis and C. Bauckhage, Efficient Subframe Video Alignment Using Short Descriptors, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.35, issue.10, pp.2371-2386, 2013.
DOI : 10.1109/TPAMI.2013.56

URL : https://hal.archives-ouvertes.fr/hal-00862002

G. Evangelidis, G. Singh, and R. Horaud, Skeletal Quads: Human Action Recognition Using Joint Quadruples, 2014 22nd International Conference on Pattern Recognition, p.ICPR, 2014.
DOI : 10.1109/ICPR.2014.772

URL : https://hal.archives-ouvertes.fr/hal-00989725

G. D. Evangelidis and C. Bauckhage, Efficient and Robust Alignment of Unsynchronized Video Sequences, In: DAGM, 2011.
DOI : 10.1007/978-3-642-23123-0_29

URL : https://hal.archives-ouvertes.fr/hal-00864392

M. Hoai, Z. Z. Lan, and F. De-la-torre, Joint segmentation and classification of human actions in video, CVPR 2011, p.CVPR, 2011.
DOI : 10.1109/CVPR.2011.5995470

T. Jaakola and D. Haussler, Exploiting generative models in discriminative classifiers, p.NIPS, 1999.

K. Kulkarni, G. Evangelidis, J. Cech, and R. Horaud, Continuous action recognition based on sequence alignment, IJCV, 2014.
DOI : 10.1007/s11263-014-0758-9

URL : https://hal.archives-ouvertes.fr/hal-01058732

D. Lang, D. W. Hogg, K. Mierle, M. Blanton, and S. Roweis, Astrometry.net: Blind astrometric calibration of arbitrary astronomical images. The astronomical journal 137, pp.1782-2800, 2010.

B. Liang and L. Zheng, Multi-modal Gesture Recognition Using Skeletal Joints and Motion Trail Model, 2014.
DOI : 10.1007/978-3-319-16178-5_44

F. Lv and R. Nevatia, Recognition and Segmentation of 3-D Human Action Using HMM and Multi-class AdaBoost, p.ECCV, 2006.
DOI : 10.1007/11744085_28

C. Monnier, S. German, and A. Ost, A Multi-scale Boosted Detector for Efficient and Robust Gesture Recognition, 2014.
DOI : 10.1007/978-3-319-16178-5_34

N. Neverova, C. Wolf, G. W. Taylor, and F. Nebout, Multi-scale Deep Learning for Gesture Detection and Localization, p.ECCV Workshops, 2014.
DOI : 10.1007/978-3-319-16178-5_33

URL : https://hal.archives-ouvertes.fr/hal-01419792

E. Ohn-bar and M. M. Trivedi, Joint angles similiarities and hog 2 for action recognition, In: Computer Vision and Pattern Recognition Workshops (CVPRW), 2013.
DOI : 10.1109/cvprw.2013.76

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.676.3540

O. Oreifej and Z. Liu, HON4D: Histogram of Oriented 4D Normals for Activity Recognition from Depth Sequences, 2013 IEEE Conference on Computer Vision and Pattern Recognition, p.CVPR, 2013.
DOI : 10.1109/CVPR.2013.98

X. Peng, L. Wang, and Z. Cai, Action and Gesture Temporal Spotting with Super Vector Representation, 2014.
DOI : 10.1007/978-3-319-16178-5_36

F. Perronnin, J. Sánchez, and T. Mensink, Improving the Fisher Kernel for Large-Scale Image Classification, p.ECCV, 2010.
DOI : 10.1007/978-3-642-15561-1_11

URL : https://hal.archives-ouvertes.fr/inria-00548630

L. Pigou, S. Dieleman, P. J. Kindermans, and B. Schrauwen, Sign Language Recognition Using Convolutional Neural Networks, 2014.
DOI : 10.1007/978-3-319-16178-5_40

Q. Shi, L. Cheng, L. Wang, and A. Smola, Human Action Segmentation and Recognition Using Discriminative Semi-Markov Models, International Journal of Computer Vision, vol.6, issue.4???5, pp.22-32, 2011.
DOI : 10.1007/s11263-010-0384-0

J. Shotton, A. Fitzgibbon, M. Cook, T. Sharp, M. Finocchio et al., Real-time human pose recognition in parts from single depth images, p.CVPR, 2011.

C. Sminchisescu, A. Kanaujia, and D. Metaxas, Conditional models for contextual human motion recognition, CVIU, vol.104, issue.2, pp.210-220, 2006.
DOI : 10.1109/iccv.2005.59

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.128.9287

T. Starner, J. Weaver, and A. Pentland, Real-time American sign language recognition using desk and wearable computer based video, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.20, issue.12, pp.1371-1375, 1998.
DOI : 10.1109/34.735811

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.125.8443

R. Vemulapalli, F. Arrate, and R. Chellappa, Human Action Recognition by Representing 3D Skeletons as Points in a Lie Group, 2014 IEEE Conference on Computer Vision and Pattern Recognition, p.CVPR, 2014.
DOI : 10.1109/CVPR.2014.82

A. W. Vieira, E. R. Nascimento, G. L. Oliveira, Z. Liu, and M. F. Campos, On the improvement of human action recognition from depth map sequences using Space???Time Occupancy Patterns, Pattern Recognition Letters, vol.36, pp.221-227, 2014.
DOI : 10.1016/j.patrec.2013.07.011

C. Vogler and D. Metaxas, ASL recognition based on a coupling between HMMs and 3D motion analysis, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271), p.ICCV, 1998.
DOI : 10.1109/ICCV.1998.710744

C. Wang, Y. Wang, and A. L. Yuille, An Approach to Pose-Based Action Recognition, 2013 IEEE Conference on Computer Vision and Pattern Recognition, p.CVPR, 2013.
DOI : 10.1109/CVPR.2013.123

H. Wang and C. Schmid, Action Recognition with Improved Trajectories, 2013 IEEE International Conference on Computer Vision, p.ICCV, 2013.
DOI : 10.1109/ICCV.2013.441

URL : https://hal.archives-ouvertes.fr/hal-00873267

J. Wang, Z. Liu, Y. Wu, and J. Yuan, Mining actionlet ensemble for action recognition with depth cameras, 2012 IEEE Conference on Computer Vision and Pattern Recognition, p.CVPR, 2012.
DOI : 10.1109/CVPR.2012.6247813

S. B. Wang, A. Quattoni, L. Morency, D. Demirdjian, and T. Darrell, Hidden conditional random fields for gesture recognition, p.CVPR, 2006.

D. Wu and L. Shao, Deep Dynamic Neural Networks for Gesture Segmentation and Recognition, In: ECCV Workshops, 2014.
DOI : 10.1007/978-3-319-16178-5_39

T. F. Wu, C. J. Lin, and R. C. Weng, Probability estimates for multi-class classification by pairwise coupling, The Journal of Machine Learning Research, vol.5, pp.975-1005, 2004.

L. Xia and J. Aggarwal, Spatio-temporal Depth Cuboid Similarity Feature for Activity Recognition Using Depth Camera, 2013 IEEE Conference on Computer Vision and Pattern Recognition, p.CVPR, 2013.
DOI : 10.1109/CVPR.2013.365

X. Yang and Y. Tian, Eigenjoints-based action recognition using naive-bayes-nearestneighbor, In: CVPR Workshops (CVPRW), 2012.
DOI : 10.1109/cvprw.2012.6239232

X. Yang and Y. Tian, Super Normal Vector for Activity Recognition Using Depth Sequences, 2014 IEEE Conference on Computer Vision and Pattern Recognition, p.CVPR, 2014.
DOI : 10.1109/CVPR.2014.108

M. Zanfir, M. Leordeanu, and C. Sminchisescu, The Moving Pose: An Efficient 3D Kinematics Descriptor for Low-Latency Action Recognition and Detection, 2013 IEEE International Conference on Computer Vision, pp.2752-2759, 2013.
DOI : 10.1109/ICCV.2013.342

Y. Zhu, W. Chen, and G. Guo, Fusing Spatiotemporal Features and Joints for 3D Action Recognition, 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp.486-491, 2013.
DOI : 10.1109/CVPRW.2013.78