Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation, 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014. ,
DOI : 10.1109/CVPR.2014.81
OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks, ICLR, 2014. ,
ImageNet Classification with Deep Convolutional Neural Networks, NIPS, 2012. ,
Learning Hierarchical Features for Scene Labeling, PAMI, 2013. ,
DOI : 10.1109/TPAMI.2012.231
URL : https://hal.archives-ouvertes.fr/hal-00742077
Indoor Semantic Segmentation using depth information, ICLR, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-00805105
Gradient-based learning applied to document recognition, Proceedings of the IEEE, vol.86, issue.11, pp.2278-2324, 1998. ,
DOI : 10.1109/5.726791
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.138.1115
Combining modality specific deep neural networks for emotion recognition in video, ICMI, 2013. ,
DeepFace: Closing the Gap to Human-Level Performance in Face Verification, 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014. ,
DOI : 10.1109/CVPR.2014.220
Spatio-Temporal Convolutional Sparse Auto-Encoder for Sequence Classification, Procedings of the British Machine Vision Conference 2012, 2012. ,
DOI : 10.5244/C.26.124
URL : https://hal.archives-ouvertes.fr/hal-01353046
Large-Scale Video Classification with Convolutional Neural Networks, 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014. ,
DOI : 10.1109/CVPR.2014.223
Two-Stream Convolutional Networks for Action Recognition in Videos, 2014. ,
MoDeep: A Deep Learning Framework Using Motion Features for Human Pose Estimation, ACCV, 2014. ,
DOI : 10.1007/978-3-319-16808-1_21
ChaLearn Looking at People Challenge 2014: Dataset and Results, ECCVW, 2014. ,
DOI : 10.1007/978-3-319-16178-5_32
URL : https://hal.archives-ouvertes.fr/hal-01381162
Dense Trajectories and Motion Boundary Descriptors for Action Recognition, International Journal of Computer Vision, vol.73, issue.2, 2013. ,
DOI : 10.1007/s11263-012-0594-8
URL : https://hal.archives-ouvertes.fr/hal-00725627
Evaluation of local spatio-temporal features for action recognition, Procedings of the British Machine Vision Conference 2009, 2009. ,
DOI : 10.5244/C.23.124
URL : https://hal.archives-ouvertes.fr/inria-00439769
Behavior Recognition via Sparse Spatio-Temporal Features, 2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance, 2005. ,
DOI : 10.1109/VSPETS.2005.1570899
Learning realistic human actions from movies, 2008 IEEE Conference on Computer Vision and Pattern Recognition, 2008. ,
DOI : 10.1109/CVPR.2008.4587756
URL : https://hal.archives-ouvertes.fr/inria-00548659
A Spatio-Temporal Descriptor Based on 3D-Gradients, Procedings of the British Machine Vision Conference 2008, 2008. ,
DOI : 10.5244/C.22.99
An Efficient Dense and Scale-Invariant Spatio-Temporal Interest Point Detector, ECCV, 2008. ,
DOI : 10.1007/978-3-540-88688-4_48
Real-time human pose recognition in parts from single depth images, CVPR 2011, 2011. ,
DOI : 10.1109/CVPR.2011.5995316
Real time hand pose estimation using depth sensors, ICCV Workshop, 2011. ,
Real-Time Articulated Hand Pose Estimation Using Semi-supervised Transductive Regression Forests, 2013 IEEE International Conference on Computer Vision, 2013. ,
DOI : 10.1109/ICCV.2013.400
Real-Time Continuous Pose Recovery of Human Hands Using Convolutional Networks, ACM Transaction on Graphics, 2014. ,
DOI : 10.1145/2629500
Hand Segmentation with Structured Convolutional Learning, ACCV, 2014. ,
DOI : 10.1007/978-3-319-16811-1_45
URL : https://hal.archives-ouvertes.fr/hal-01419789
Efficient model-based 3D tracking of hand articulations using Kinect, Procedings of the British Machine Vision Conference 2011, 2011. ,
DOI : 10.5244/C.25.101
Realtime and Robust Hand Tracking from Depth, 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014. ,
DOI : 10.1109/CVPR.2014.145
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.454.4572
Latent Regression Forest: Structured Estimation of 3D Articulated Hand Posture, 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014. ,
DOI : 10.1109/CVPR.2014.490
Beyond Physical Connections: Tree Models in Human Pose Estimation, 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013. ,
DOI : 10.1109/CVPR.2013.83
Detect What You Can: Detecting and Representing Objects Using Holistic Models and Body Parts, 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014. ,
DOI : 10.1109/CVPR.2014.254
URL : http://arxiv.org/abs/1406.2031
Mining actionlet ensemble for action recognition with depth cameras, 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012. ,
DOI : 10.1109/CVPR.2012.6247813
Unstructured Human Activity Detection from RGBD Images, ICRA, 2012. ,
Online RGB-D gesture recognition with extreme learning machines, Proceedings of the 15th ACM on International conference on multimodal interaction, ICMI '13, 2013. ,
DOI : 10.1145/2522848.2532591
A Multi-scale Boosted Detector for Efficient and Robust Gesture Recognition, ECCVW, 2014. ,
DOI : 10.1007/978-3-319-16178-5_34
Nonparametric Gesture Labeling from Multi-modal Data, ECCV Workshop, 2014. ,
DOI : 10.1007/978-3-319-16178-5_35
A Multi-modal Gesture Recognition System Using Audio, Video, and Skeletal Joint Data Categories and Subject Descriptors, ICMI Workshop, 2013. ,
Learning hierarchical invariant spatio-temporal features for action recognition with independent subspace analysis, CVPR 2011, 2011. ,
DOI : 10.1109/CVPR.2011.5995496
URL : http://ai.stanford.edu/~quocle/LeZouYeungNg11_appendix.pdf
Unsupervised Learning of Invariant Feature Hierarchies with Applications to Object Recognition, 2007 IEEE Conference on Computer Vision and Pattern Recognition, 2007. ,
DOI : 10.1109/CVPR.2007.383157
Deep learning of invariant Spatio-Temporal Features from Video, NIPSW, 2010. ,
On feature combination for multiclass object classification, 2009 IEEE 12th International Conference on Computer Vision, 2009. ,
DOI : 10.1109/ICCV.2009.5459169
Robust Late Fusion With Rank Minimization, CVPR, 2012. ,
Sample-Specific Late Fusion for Visual Category Recognition, 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013. ,
DOI : 10.1109/CVPR.2013.109
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.394.155
Feature Weighting via Optimal Thresholding for Video Analysis, 2013 IEEE International Conference on Computer Vision, 2013. ,
DOI : 10.1109/ICCV.2013.427
URL : https://opus.lib.uts.edu.au/bitstream/10453/29571/1/2013004175OK.pdf
Multimodal feature fusion for robust event detection in web videos, 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012. ,
DOI : 10.1109/CVPR.2012.6247814
Multimodal learning with Deep Boltzmann Machines A multi-scale approach to gesture detection and recognition, NIPS, 2013. [48] ICCV Workshop, 2013. ,
Multi-scale Deep Learning for Gesture Detection and Localization, ECCVW, 2014. ,
DOI : 10.1007/978-3-319-16178-5_33
URL : https://hal.archives-ouvertes.fr/hal-01419792
The Moving Pose: An Efficient 3D Kinematics Descriptor for Low-Latency Action Recognition and Detection, 2013 IEEE International Conference on Computer Vision, 2013. ,
DOI : 10.1109/ICCV.2013.342
Recent advances in deep learning for speech recognition at Microsoft, ICASSP, 2013. ,
On combining classifiers using sum and product rules, Pattern Recognition Letters, pp.1283-1289, 2001. ,
DOI : 10.1016/S0167-8655(01)00073-3
The dropout learning algorithm, Artificial Intelligence, vol.210, pp.78-122, 2014. ,
DOI : 10.1016/j.artint.2014.02.004
Improving neural networks by preventing coadaptation of feature detectors, 2012. ,
Fast dropout training, ICML, 2013. ,
Elements of Large-Sample Theory, ICML, 1998. ,
DOI : 10.1007/b98855
Extremely randomized trees, Machine learning, pp.3-42, 2006. ,
DOI : 10.1007/s10994-006-6226-1
URL : https://hal.archives-ouvertes.fr/hal-00341932
Julius -an open source realtime large vocabulary recognition engine, Interspeech, 2001. ,
Gesture Recognition Using Template Based Random Forest Classifiers, ECCVW, 2014. ,
DOI : 10.1007/978-3-319-16178-5_41
Continuous Gesture Recognition from Articulated Poses, ECCV Workshop, 2014. ,
DOI : 10.1007/978-3-319-16178-5_42
URL : https://hal.archives-ouvertes.fr/hal-01082981
Action and Gesture Temporal Spotting with Super Vector Representation, ECCVW, 2014. ,
DOI : 10.1007/978-3-319-16178-5_36
Multi-modality Gesture Detection and Recognition With Unsupervision , Randomization and Discrimination, ECCVW, 2014. ,
DOI : 10.1007/978-3-319-16178-5_43
Sign Language Recognition Using Convolutional Neural Networks, ECCVW, 2014. ,
DOI : 10.1007/978-3-319-16178-5_40
URL : http://hdl.handle.net/1854/LU-5796137
Deep Dynamic Neural Networks for Gesture Segmentation and Recognition, ECCV Workshop, 2014. ,
DOI : 10.1007/978-3-319-16178-5_39
Gradient-based learning applied to document recognition, Proceedings of the IEEE, pp.2278-2324, 1998. ,
DOI : 10.1109/5.726791