A. , A. Tombari, F. Rusu, R. And, and M. Vincze, OUR-CVFH?Oriented, Unique and Repeatable Clustered Viewpoint Feature Histogram for Object Recognition and 6DOF Pose Estimation, 2012.

A. , A. Vincze, M. Blodow, N. Gossow, D. Gedikli et al., Cad-model recognition and 6dof pose estimation using 3d cues, Computer Vision Workshops (ICCV Workshops) IEEE International Conference on, pp.585-592, 2011.

A. , S. Thome, N. Cord, M. Valle, E. And et al., Extended bow formalism for image classification, 18th IEEE International Conference on Image Processing, pp.2909-2912, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00625533

B. , J. Nie, J. And-paradis, and F. , Using language models for text classification, Proceedings of the Asia Information Retrieval Symposium, 2004.

J. K. Basu, D. Bhattacharyya, and T. And-kim, Use of artificial neural network in pattern recognition, International Journal of Software Engineering and Its Applications, vol.4, issue.2, 2010.

B. , H. Ess, A. Tuytelaars, T. And, and L. Van-gool, Speeded-up robust features (surf) Computer vision and image understanding, pp.346-359, 2008.

B. , H. Tuytelaars, T. And, and L. Van-gool, Surf: Speeded up robust features, Computer vision?ECCV 2006, pp.404-417, 2006.

Y. Bengio, Learning deep architectures for ai. Foundations and trends R in, Machine Learning, vol.2, issue.1, pp.1-127, 2009.
DOI : 10.1561/2200000006

B. , L. Ren, X. And-fox, and D. , Depth kernel descriptors for object recognition, Intelligent Robots and Systems (IROS) IEEE/RSJ International Conference on, pp.821-826, 2011.

B. , A. Pratikakis, I. And-perantonis, and S. , Bag of spatio-visual words for context inference in scene classification, Pattern Recognition, vol.46, issue.3, pp.1039-1053, 2013.

B. , B. E. Guyon, I. M. And-vapnik, and V. N. , A training algorithm for optimal margin classifiers, Proceedings of the fifth annual workshop on Computational learning theory, pp.144-152, 1992.

C. , G. Dance, C. Fan, L. Willamowski, J. And-bray et al., Visual categorization with bags of keypoints, Workshop on statistical learning in computer vision, ECCV, pp.1-2, 2004.

D. , M. Corke, P. Vasilescu, I. And-rus, and D. , Data muling over underwater wireless sensor networks using an autonomous underwater vehicle, Proceedings 2006 IEEE International Conference on Robotics and Automation, pp.2091-2098, 2006.

E. , A. Springenberg, J. T. Spinello, L. Riedmiller, M. And-burgard et al., Multimodal deep learning for robust rgb-d object recognition, Intelligent Robots and Systems (IROS), 2015 IEEE/RSJ International Conference on, pp.681-687, 2015.

F. , B. Ng, W. S. Chauhan, S. And-kwoh, and C. K. , The safety issues of medical robotics, Reliability Engineering & System Safety, vol.73, issue.2, pp.183-192, 2001.

F. , R. Perona, P. And-zisserman, and A. , Object class recognition by unsupervised scale-invariant learning, Computer Vision and Pattern Recognition Proceedings. 2003 IEEE Computer Society Conference on, p.264, 2003.

F. , J. And-disalvo, and C. , Service robots in the domestic environment: a study of the roomba vacuum in the home, Proceedings of the 1st ACM SIGCHI/SIGART conference on Human-robot interaction, pp.258-265, 2006.

G. , J. Burghouts, G. J. And-smeulders, and A. W. , The amsterdam library of object images, International Journal of Computer Vision, vol.61, issue.1, pp.103-112, 2005.

H. , G. E. Osindero, S. And-teh, and Y. , A fast learning algorithm for deep belief nets, Neural computation, vol.18, issue.7, pp.1527-1554, 2006.

H. , F. Xia, G. Wang, Z. Huang, X. Zhang et al., Unsupervised feature learning via spectral clustering of multidimensional patches for remotely sensed scene classification, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, vol.8, issue.5, 2015.

J. , A. Karayev, S. Jia, Y. Barron, J. T. Fritz et al., A category-level 3d object dataset: Putting the kinect to work, Consumer Depth Cameras for Computer Vision, pp.141-165, 2013.

J. , A. , A. Hebert, and M. , Using spin images for efficient object recognition in cluttered 3d scenes. Pattern Analysis and Machine Intelligence, IEEE Transactions on, vol.21, issue.5, pp.433-449, 1999.

K. , R. Barat, C. Muselet, D. And-ducottet, and C. , Spatial orientations of visual word pairs to improve bag-of-visual-words model, Proceedings of the British Machine Vision Conference, pp.89-90, 2012.
URL : https://hal.archives-ouvertes.fr/ujm-00738708

L. , K. Bo, L. Ren, X. And-fox, and D. , A large-scale hierarchical multi-view rgb-d object dataset, Robotics and Automation (ICRA), 2011 IEEE International Conference on, pp.1817-1824, 2011.

L. , K. Bo, L. Ren, X. And-fox, and D. , A large-scale hierarchical multi-view rgb-d object dataset, Robotics and Automation (ICRA), 2011 IEEE International Conference on, pp.1817-1824, 2011.

L. , D. Verbeek, J. And-jurie, and F. , Category level object segmentation by combining bag-of-words models with dirichlet processes and random fields, International Journal of Computer Vision, vol.88, issue.2, pp.238-253, 2010.
URL : https://hal.archives-ouvertes.fr/inria-00439303

L. , Y. Huang, F. J. And-bottou, and L. , Learning methods for generic object recognition with invariance to pose and lighting, Computer Vision and Pattern Recognition Proceedings of the 2004 IEEE Computer Society Conference on, p.97, 2004.

L. , M. Ma, W. Li, Z. And-wu, and L. , Visual language modeling for image classification, US Patent, vol.8126, p.274, 2012.

L. , T. Mei, T. Kweon, I. And-hua, and X. , Contextual bag-of-words for visual categorization . Circuits and Systems for Video Technology, IEEE Transactions on, vol.21, issue.4, pp.381-392, 2011.

M. , L. Otte, S. Hanten, R. And-zell, and A. , Revisiting deep convolutional neural networks for rgb-d based object recognition, International Conference on Artificial Neural Networks, pp.29-37, 2016.

M. , M. Ek, C. H. Detry, R. Hang, K. And-kragic et al., Improving generalization for 3d object categorization with global structure histograms, Intelligent Robots and Systems (IROS), 2012 IEEE/RSJ International Conference on, pp.1379-1386, 2012.

M. Donald and K. R. , Discrete language models for video retrieval, 2005.

M. , S. And-lowe, and D. G. , Local naive bayes nearest neighbor for image classification, Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on, pp.3650-3656, 2012.

M. , A. Bennamoun, M. And-owens, and R. , On the repeatability and quality of keypoints for local feature-based 3d object retrieval from cluttered scenes, International Journal of Computer Vision, vol.89, pp.2-3, 2010.

N. , V. And-hinton, and G. , 3d object recognition with deep belief nets, Advances in Neural Information Processing Systems, pp.1339-1347, 2009.

O. , F. Z. Zrira, N. Bouyakhf, E. H. And-himmi, and M. , M. 3d object categorization and recognition based on deep belief networks and point clouds, Proceedings of the 13th International Conference on Informatics in Control, Automation and Robotics, pp.311-318, 2016.

P. , J. Chum, O. Isard, M. Sivic, J. And-zisserman et al., Object retrieval with large vocabularies and fast spatial matching, Computer Vision and Pattern Recognition, pp.1-8, 2007.

R. , R. Blodow, N. And-beetz, and M. , Fast point feature histograms (fpfh) for 3d registration, Robotics and Automation, 2009. ICRA'09. IEEE International Conference on, pp.3212-3217, 2009.

R. , R. Blodow, N. Marton, Z. And-beetz, and M. , Aligning point cloud views using persistent feature histograms, Intelligent Robots and Systems IEEE/RSJ International Conference on, pp.3384-3391, 2008.

R. , R. Bradski, G. Thibaux, R. And-hsu, and J. , Fast 3d recognition and pose using the viewpoint feature histogram, Intelligent Robots and Systems (IROS) IEEE/RSJ International Conference on, pp.2155-2162, 2010.

R. , R. And-cousins, and S. , 3D is here: Point Cloud Library (PCL), IEEE International Conference on Robotics and Automation (ICRA), 2011.

S. , S. And-fei-fei, and L. , 3d generic object categorization, localization and pose estimation, Computer Vision, 2007. ICCV 2007. IEEE 11th International Conference on, pp.1-8, 2007.

S. , M. Schulz, H. And-behnke, and S. , Rgb-d object recognition and pose estimation based on pre-trained convolutional neural network features, Robotics and Automation (ICRA), 2015 IEEE International Conference on, pp.1329-1335, 2015.

S. , P. Ali, S. And, and M. Shah, A 3-dimensional sift descriptor and its application to action recognition, Proceedings of the 15th international conference on Multimedia, pp.357-360, 2007.

S. , J. And-zisserman, and A. , Video google: A text retrieval approach to object matching in videos, Computer Vision, 2003. Proceedings. Ninth IEEE International Conference on, pp.1470-1477, 2003.

S. , R. Huval, B. Bath, B. Manning, C. D. And-ng et al., Convolutionalrecursive deep learning for 3d object classification, Advances in Neural Information Processing Systems (2012), pp.665-673

T. , S. Wang, X. Lv, X. Han, T. X. Keller et al., Histogram of oriented normal vectors for object recognition with a depth sensor, Asian conference on computer vision, pp.525-538, 2012.

T. , R. Castellani, U. And-fusiello, and A. , A bag of words approach for 3d object categorization, Computer Vision/Computer Graphics CollaborationTechniques, pp.116-127, 2009.

T. , F. Salti, S. , A. D. Stefano, and L. , Unique signatures of histograms for local surface description, Computer Vision?ECCV 2010, pp.356-369, 2010.

T. , F. Salti, S. And-stefano, and L. , A combined texture-shape descriptor for enhanced 3d feature matching, Image Processing (ICIP), 2011 18th IEEE International Conference on, pp.809-812, 2011.

T. , A. Murphy, K. P. Freeman, W. T. And-rubin, and M. A. , Context-based vision system for place and object recognition, Computer Vision, 2003. Proceedings. Ninth IEEE International Conference on, pp.273-280, 2003.

D. A. Vigo, F. S. Khan, J. Van-de-weijer, and T. And-gevers, The Impact of Color on Bag-of-Words Based Object Recognition, 2010 20th International Conference on Pattern Recognition, pp.1549-1553, 2010.
DOI : 10.1109/ICPR.2010.383

G. Visentin, M. Van-winnendael, and P. And-putz, Advanced mechatronics in ESA's space robotics developments, 2001 IEEE/ASME International Conference on Advanced Intelligent Mechatronics. Proceedings (Cat. No.01TH8556), pp.1261-1266, 2001.
DOI : 10.1109/AIM.2001.936901

W. , W. And, and M. Vincze, Ensemble of shape functions for 3d object classification, Robotics and Biomimetics (ROBIO), 2011 IEEE International Conference on, pp.2987-2992, 2011.

W. , L. Hoi, S. C. And, Y. , and N. , Semantics-preserving bag-of-words models and applications, Image Processing IEEE Transactions on, vol.19, issue.7, pp.1908-1920, 2010.

Z. , H. Berg, A. C. Maire, M. And-malik, and J. , Svm-knn: Discriminative nearest neighbor classification for visual category recognition, IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06), pp.2126-2136, 2006.

Z. , L. Wang, S. Liu, Z. And-tian, and Q. , Packing and padding: Coupled multi-index for accurate image retrieval, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp.1939-1946, 2014.

Z. , L. Rao, A. B. And-zhang, and A. , Theory of keyblock-based image retrieval, ACM Transactions on Information Systems (TOIS), vol.20, issue.2, pp.224-257, 2002.