R. Aubry, D. Maturana, A. A. Efros, C. Bryan, J. Russell et al., Seeing 3D Chairs: Exemplar Part-Based 2D-3D Alignment Using a Large Dataset of CAD Models, 2014 IEEE Conference on Computer Vision and Pattern Recognition, pp.3762-3769, 2014.
DOI : 10.1109/CVPR.2014.487

URL : https://hal.archives-ouvertes.fr/hal-01057240

V. Badrinarayanan, A. Kendall, and R. Cipolla, SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.39, issue.12, 2015.
DOI : 10.1109/TPAMI.2016.2644615

T. Jonathan, J. Barron, and . Malik, Intrinsic scene properties from a single rgb-d image, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp.17-24, 2013.

M. Gschwandtner, R. Kwitt, A. Uhl, and W. Pree, BlenSor: Blender Sensor Simulation Toolbox, International Symposium on Visual Computing, pp.199-208, 2011.
DOI : 10.1007/s11721-008-0014-4

S. Gupta, P. Arbeláez, R. Girshick, and J. Malik, Aligning 3D models to RGB-D images of cluttered scenes, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.4731-4740, 2015.
DOI : 10.1109/CVPR.2015.7299105

S. Gupta, P. Arbelaez, and J. Malik, Perceptual Organization and Recognition of Indoor Scenes from RGB-D Images, 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013.
DOI : 10.1109/CVPR.2013.79

S. Gupta, R. Girshick, P. Arbeláez, and J. Malik, Learning Rich Features from RGB-D Images for Object Detection and Segmentation, European Conference on Computer Vision, pp.345-360, 2014.
DOI : 10.1007/978-3-319-10584-0_23

A. Handa, V. Patraucean, V. Badrinarayanan, S. Stent, and R. Cipolla, Scenenet : Understanding real world indoor scenes with synthetic data, 2015.
DOI : 10.1109/cvpr.2016.442

A. Handa, T. Whelan, J. Mcdonald, J. Andrew, and . Davison, A benchmark for RGB-D visual odometry, 3D reconstruction and SLAM, 2014 IEEE International Conference on Robotics and Automation (ICRA), pp.1524-1531, 2014.
DOI : 10.1109/ICRA.2014.6907054

Y. Jia, E. Shelhamer, J. Donahue, S. Karayev, J. Long et al., Caffe, Proceedings of the ACM International Conference on Multimedia, MM '14, pp.675-678, 2014.
DOI : 10.1145/2647868.2654889

A. Levin, D. Lischinski, and Y. Weiss, Colorization using optimization, ACM Transactions on Graphics, vol.23, issue.3, pp.689-694, 2004.
DOI : 10.1145/1015706.1015780

J. Joseph, H. Lim, A. Pirsiavash, and . Torralba, Parsing ikea objects : Fine pose estimation, Proceedings of the IEEE International Conference on Computer Vision, pp.2992-2999, 2013.

J. Long, E. Shelhamer, and T. Darrell, Fully convolutional networks for semantic segmentation, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.3431-3440, 2015.
DOI : 10.1109/CVPR.2015.7298965

J. Mccormac, A. Handa, A. Davison, and S. Leutenegger, SemanticFusion: Dense 3D semantic mapping with convolutional neural networks, 2017 IEEE International Conference on Robotics and Automation (ICRA), 2016.
DOI : 10.1109/ICRA.2017.7989538

H. Noh, S. Hong, and B. Han, Learning Deconvolution Network for Semantic Segmentation, 2015 IEEE International Conference on Computer Vision (ICCV), pp.1520-1528, 2015.
DOI : 10.1109/ICCV.2015.178

X. Ren, L. Bo, and D. Fox, Rgb-(d) scene labeling : Features and algorithms, Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on, pp.2759-2766, 2012.

S. Sengupta, E. Greveson, A. Shahrokni, H. Philip, and . Torr, Urban 3D semantic modelling using stereo vision, 2013 IEEE International Conference on Robotics and Automation, pp.580-585, 2013.
DOI : 10.1109/ICRA.2013.6630632

E. Shelhamer, J. Long, and T. Darrell, Fully Convolutional Networks for Semantic Segmentation, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.39, issue.4, 2016.
DOI : 10.1109/TPAMI.2016.2572683

N. Silberman, D. Hoiem, P. Kohli, and R. Fergus, Indoor Segmentation and Support Inference from RGBD Images, ECCV, 2012.
DOI : 10.1007/978-3-642-33715-4_54

S. Song and J. Xiao, Sliding Shapes for 3D Object Detection in Depth Images, European conference on computer vision, pp.634-651, 2014.
DOI : 10.1007/978-3-319-10599-4_41

A. Valada, J. Vertens, A. Dhall, and W. Burgard, AdapNet: Adaptive semantic segmentation in adverse environmental conditions, 2017 IEEE International Conference on Robotics and Automation (ICRA), 2017.
DOI : 10.1109/ICRA.2017.7989540

Z. Wu, S. Song, A. Khosla, F. Yu, and L. Zhang, Xiaoou Tang, and Jianxiong Xiao. 3d shapenets : A deep representation for volumetric shapes, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp.1912-1920, 2015.