K. He, X. Zhang, S. Ren, and J. Sun, Deep residual learning for image recognition, Proceedings of the IEEE conference on computer vision and pattern recognition, pp.770-778, 2016.

S. Gupta, P. Arbelaez, and J. Malik, Perceptual organization and recognition of indoor scenes from rgb-d images, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp.564-571, 2013.

C. Couprie, C. Farabet, L. Najman, and Y. Lecun, Indoor semantic segmentation using depth information, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00805105

J. Long, E. Shelhamer, and T. Darrell, Fully convolutional networks for semantic segmentation, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp.3431-3440, 2015.

G. Lin, A. Milan, C. Shen, and I. Reid, Refinenet: Multi-path refinement networks with identity mappings for high-resolution semantic segmentation, 2016.

B. Ummenhofer, H. Zhou, J. Uhrig, N. Mayer, E. Ilg et al., Demon: Depth and motion network for learning monocular stereo, 2016.

A. Kendall, M. Grimes, and R. Cipolla, Posenet: A convolutional network for real-time 6-dof camera relocalization, Proceedings of the IEEE international conference on computer vision, pp.2938-2946, 2015.

R. Gomez-ojeda, J. Briales, and J. Gonzalez-jimenez, Pl-svo: Semidirect monocular visual odometry by combining points and line segments, Intelligent Robots and Systems (IROS), pp.4211-4216, 2016.

L. Ma, C. Kerl, J. Stückler, and D. Cremers, Cpa-slam: Consistent plane-model alignment for direct rgb-d slam, Robotics and Automation (ICRA), 2016 IEEE International Conference on, pp.1285-1291, 2016.

R. F. Salas-moreno, R. A. Newcombe, H. Strasdat, P. H. Kelly, and A. J. Davison, Slam++: Simultaneous localisation and mapping at the level of objects, Proceedings of the IEEE conference on computer vision and pattern recognition, pp.1352-1359, 2013.

D. Nister, O. Naroditsky, and J. Bergen, Visual odometry, IEEE Conference on Computer Vision and Pattern Recognition, vol.1, 2004.

C. A. , E. Malis, and P. Rives, Accurate quadri-focal tracking for robust 3D visual odometry, IEEE International Conference on Robotics and Automation, 2007.
URL : https://hal.archives-ouvertes.fr/hal-01357382

C. Audras, A. I. Comport, M. Meilland, and P. Rives, Real-time dense rgb-d localisation and mapping, Australian Conference on Robotics and Automation, 2011.
URL : https://hal.archives-ouvertes.fr/hal-01357372

F. Steinbrücker, J. Sturm, and D. Cremers, Real-time visual odometry from dense rgb-d images, Computer Vision Workshops (ICCV Workshops, pp.719-722, 2011.

R. A. Newcombe, S. Izadi, O. Hilliges, D. Molyneaux, D. Kim et al., Kinectfusion: Real-time dense surface mapping and tracking, 10th IEEE international symposium on, pp.127-136, 2011.

T. Tykkälä, C. Audras, and A. I. Comport, Direct iterative closest point for real-time visual odometry, Computer Vision Workshops (ICCV Workshops, pp.2050-2056, 2011.

I. Melekhov, J. Kannala, and E. Rahtu, Relative camera pose estimation using convolutional neural networks, 2017.

S. Wang, R. Clark, H. Wen, and N. Trigoni, Deepvo: Towards end-toend visual odometry with deep recurrent convolutional neural networks, IEEE, pp.2043-2050, 2017.

S. Vijayanarasimhan, S. Ricco, C. Schmid, R. Sukthankar, and K. Fragkiadaki, Sfm-net: Learning of structure and motion from video, 2017.

T. Zhou, M. Brown, N. Snavely, and D. G. Lowe, Unsupervised learning of depth and ego-motion from video, 2017.

R. Li, S. Wang, Z. Long, and D. Gu, Undeepvo: Monocular visual odometry through unsupervised deep learning, 2017.

V. Peretroukhin and J. Kelly, Dpc-net: Deep pose correction for visual localization, 2017.

J. Czarnowski, S. Leutenegger, and A. Davison, Semantic texture for robust dense tracking, 2017.

S. Baker and I. Matthews, Lucas-kanade 20 years on: A unifying framework, International journal of computer vision, vol.56, issue.3, pp.221-255, 2004.

P. Krähenbühl and V. Koltun, Efficient inference in fully connected crfs with gaussian edge potentials, Advances in neural information processing systems, pp.109-117, 2011.

N. Silberman, D. Hoiem, P. Kohli, and R. Fergus, Indoor segmentation and support inference from rgbd images, Computer Vision-ECCV 2012, pp.746-760, 2012.

G. Blais and M. D. Levine, Registering multiview range data to create 3d computer objects, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.17, issue.8, pp.820-824, 1995.

L. Morency and T. Darrell, Stereo tracking using icp and normal flow constraint, Pattern Recognition, 2002. Proceedings. 16th International Conference on, vol.4, pp.367-372, 2002.

A. Dai, A. X. Chang, M. Savva, M. Halber, T. Funkhouser et al., Scannet: Richly-annotated 3d reconstructions of indoor scenes, Proc. Computer Vision and Pattern Recognition (CVPR), 2017.

J. Sturm, N. Engelhard, F. Endres, W. Burgard, and D. Cremers, A benchmark for the evaluation of rgb-d slam systems, Proc. of the International Conference on Intelligent Robot Systems (IROS), 2012.