T. Bogdan-alexe, V. Deselaers, and . Ferrari, What is an Object ?, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2010.

T. Bogdan-alexe, V. Deselaers, and . Ferrari, Measuring the Objectness of Image Windows, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.34, issue.11, pp.2189-2202, 2012.
DOI : 10.1109/TPAMI.2012.28

D. Anguelov, C. Dulong, D. Filip, C. Frueh, S. Lafon et al., Google Street View: Capturing the World at Street Level, Computer, vol.43, issue.6, pp.32-38, 2010.
DOI : 10.1109/MC.2010.170

E. Matthew, S. Antone, and . Teller, Automatic recovery of relative camera rotations for urban scenes, Computer Vision and Pattern Recognition Proceedings. IEEE Conference on, pp.282-289, 2000.

R. Arandjelovic and A. Zisserman, All About VLAD, 2013 IEEE Conference on Computer Vision and Pattern Recognition, pp.1578-1585, 2013.
DOI : 10.1109/CVPR.2013.207

A. Armagan, M. Hirzer, M. Peter, V. Roth, and . Lepetit, Accurate camera registration in urban environments using high-level feature matching, British Machine Vision Conference, 2017.

A. Armagan, M. Hirzer, M. Peter, V. Roth, and . Lepetit, Learning to align semantic segmentation and 2.5 d maps for geolocalization, Conference on Computer Vision and Pattern Recognition, 2017.
DOI : 10.1109/cvpr.2017.488

C. Arth, C. Pirchheim, J. Ventura, D. Schmalstieg, and V. Lepetit, Instant Outdoor Localization and SLAM Initialization from 2.5D Maps, Proceedings of the IEEE International Symposium on Mixed and Augmented Reality, pp.1309-1318, 2015.
DOI : 10.1109/TVCG.2015.2459772

V. Badrinarayanan, A. Handa, and R. Cipolla, SegNet : A Deep Convolutional Encoder-Decoder Architecture for Robust Semantic Pixel-Wise Labelling, 2015.
DOI : 10.1109/tpami.2016.2644615

URL : https://doi.org/10.1109/tpami.2016.2644615

S. Baker and I. Matthews, Equivalence and efficiency of image alignment algorithms, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001, 2001.
DOI : 10.1109/CVPR.2001.990652

A. Banerjee, S. Inderjit, J. Dhillon, S. Ghosh, and . Sra, Clustering on the unit hypersphere using von mises-fisher distributions, Journal of Machine Learning Research, vol.6, pp.1345-1382, 2005.

A. Bansal, B. Russell, and A. Gupta, Marr Revisited: 2D-3D Alignment via Surface Normal Prediction, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.5965-5974, 2016.
DOI : 10.1109/CVPR.2016.642

URL : http://arxiv.org/pdf/1604.01347

O. Barinova, V. Lempitsky, E. Tretiak, and P. Kohli, Geometric Image Parsing in Man-Made Environments, Computer Vision?ECCV, pp.57-70, 2010.
DOI : 10.1007/978-3-642-15552-9_5

T. Stephen and . Barnard, Interpreting perspective images, Artificial intelligence, vol.21, issue.4, pp.435-462, 1983.

H. Bay, T. Tuytelaars, and L. Van-gool, Surf : Speeded up robust features. Computer vision?ECCV, pp.404-417, 2006.
DOI : 10.1007/11744023_32

J. Bazin and M. Pollefeys, 3-line RANSAC for orthogonal vanishing point detection, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, pp.4282-4287, 2012.
DOI : 10.1109/IROS.2012.6385802

J. Bazin, Y. Seo, C. Demonceaux, P. Vasseur, K. Ikeuchi et al., Globally optimal line clustering and vanishing point estimation in Manhattan world, 2012 IEEE Conference on Computer Vision and Pattern Recognition, pp.638-645, 2012.
DOI : 10.1109/CVPR.2012.6247731

URL : https://hal.archives-ouvertes.fr/hal-00697707

S. Benhimane and E. Malis, Real-time image-based tracking of planes using efficient second-order minimization, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566), pp.943-948, 2004.
DOI : 10.1109/IROS.2004.1389474

J. Burochin, B. Vallet, M. Brédif, C. Mallet, T. Brosset et al., Detecting blind building fa??ades from highly overlapping wide angle aerial imagery, ISPRS Journal of Photogrammetry and Remote Sensing, vol.96, pp.193-209, 2014.
DOI : 10.1016/j.isprsjprs.2014.07.011

URL : https://hal.archives-ouvertes.fr/hal-01559518/file/1-s2.0-S0924271614001944-main.pdf

M. Calonder, V. Lepetit, C. Strecha, and P. Fua, BRIEF: Binary Robust Independent Elementary Features, Computer Vision?ECCV, pp.778-792, 2010.
DOI : 10.1007/978-3-642-15561-1_56

URL : http://cvlab.epfl.ch/publications/publications/2010/LepetitF10.pdf

K. Chatfield, K. Simonyan, A. Vedaldi, and A. Zisserman, Return of the Devil in the Details: Delving Deep into Convolutional Nets, Proceedings of the British Machine Vision Conference 2014, 2014.
DOI : 10.5244/C.28.6

G. Liang-chieh-chen, I. Papandreou, K. Kokkinos, A. L. Murphy, and . Yuille, Deeplab : Semantic image segmentation with deep convolutional nets, atrous convolution , and fully connected crfs. arXiv preprint, 2016.

S. Chopra, R. Hadsell, and Y. Lecun, Learning a Similarity Metric Discriminatively, with Application to Face Verification, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), pp.539-546, 2005.
DOI : 10.1109/CVPR.2005.202

H. Chu, S. Wang, R. Urtasun, and S. Fidler, HouseCraft: Building Houses from Rental Ads and Street Views, Proceedings of the European Conference on Computer Vision, pp.500-516, 2016.
DOI : 10.1109/ICCV.2013.51

D. Conrad, N. Guilherme, and . Desouza, Homography-based ground plane detection for mobile robot navigation using a Modified EM algorithm, 2010 IEEE International Conference on Robotics and Automation, pp.910-915, 2010.
DOI : 10.1109/ROBOT.2010.5509457

G. Csurka, C. Dance, L. Fan, J. Willamowski, and C. Bray, Visual categorization with bags of keypoints, Workshop on statistical learning in computer vision, ECCV, number 1-22 in 1, pp.1-2, 2004.

M. Cummins and P. Newman, FAB-MAP: Probabilistic Localization and Mapping in the Space of Appearance, The International Journal of Robotics Research, vol.2, issue.6, pp.647-665, 2008.
DOI : 10.1109/TRO.2004.835453

N. Dalal and B. Triggs, Histograms of Oriented Gradients for Human Detection, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), pp.886-893, 2005.
DOI : 10.1109/CVPR.2005.177

URL : https://hal.archives-ouvertes.fr/inria-00548512

A. Dame and E. Marchand, Accurate real-time tracking using mutual information, 2010 IEEE International Symposium on Mixed and Augmented Reality, pp.47-56, 2010.
DOI : 10.1109/ISMAR.2010.5643550

URL : https://hal.archives-ouvertes.fr/inria-00544786

P. Arthur, . Dempster, M. Nan, . Laird, B. Donald et al., Maximum likelihood from incomplete data via the em algorithm, Journal of the royal statistical society. Series B (methodological ), pp.1-38, 1977.

P. Denis, H. James, . Elder, J. Francisco, and . Estrada, Efficient Edge-Based Methods for Estimating Manhattan Frames in Urban Imagery, European conference on computer vision, pp.197-210, 2008.
DOI : 10.1109/34.689301

D. Detone, T. Malisiewicz, and A. Rabinovich, Deep image homography estimation. arXiv preprint, 2016.

D. Eigen and R. Fergus, Predicting Depth, Surface Normals and Semantic Labels with a Common Multi-scale Convolutional Architecture, 2015 IEEE International Conference on Computer Vision (ICCV), pp.2650-2658, 2015.
DOI : 10.1109/ICCV.2015.304

D. Georgios, R. Evangelidis, and . Horaud, Joint alignment of multiple point sets with batch and incremental expectation-maximization, IEEE transactions on pattern analysis and machine intelligence, 2017.

C. Farabet, C. Couprie, L. Najman, and Y. Lecun, Learning Hierarchical Features for Scene Labeling, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.35, issue.8, pp.1915-1929, 2013.
DOI : 10.1109/TPAMI.2012.231

URL : https://hal.archives-ouvertes.fr/hal-00742077

A. Fond, M. Berger, and G. Simon, Prior-Based Facade Rectification for AR in Urban Environment, 2015 IEEE International Symposium on Mixed and Augmented Reality Workshops, pp.94-99, 2015.
DOI : 10.1109/ISMARW.2015.25

URL : https://hal.archives-ouvertes.fr/hal-01235842

A. Fond, M. Berger, and G. Simon, Facade Proposals for Urban Augmented Reality, 2017 IEEE International Symposium on Mixed and Augmented Reality (ISMAR), 2017.
DOI : 10.1109/ISMAR.2017.20

URL : https://hal.archives-ouvertes.fr/hal-01562392

J. Franco and E. Boyer, Learning temporally consistent rigidities, CVPR 2011, pp.1241-1248, 2011.
DOI : 10.1109/CVPR.2011.5995440

URL : https://hal.archives-ouvertes.fr/inria-00583131

B. Fröhlich, E. Rodner, and J. Denzler, A Fast Approach for Pixelwise Labeling of Facade Images, 2010 20th International Conference on Pattern Recognition, pp.3029-3032, 2010.
DOI : 10.1109/ICPR.2010.742

K. Fukushima and S. Miyake, Neocognitron: A Self-Organizing Neural Network Model for a Mechanism of Visual Pattern Recognition, Competition and cooperation in neural nets, pp.267-285, 1982.
DOI : 10.1007/978-3-642-46466-9_18

R. Gadde, V. Jampani, R. Marlet, and P. Gehler, Efficient 2D and 3D Facade Segmentation using Auto-Context. CoRR, abs, 1606.
DOI : 10.1109/tpami.2017.2696526

URL : https://hal.archives-ouvertes.fr/hal-01743579

J. Gauvain and C. Lee, Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains, IEEE Transactions on Speech and Audio Processing, vol.2, issue.2, pp.291-298, 1994.
DOI : 10.1109/89.279278

B. Ghanem, A. Thabet, J. C. Niebles, and F. C. Heilbron, Robust Manhattan Frame estimation from a single RGB-D image, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.3772-3780, 2015.
DOI : 10.1109/CVPR.2015.7299001

URL : http://repository.kaust.edu.sa/kaust/bitstream/10754/556139/1/robust_layout_estimation_CVPR2015.pdf

R. Girshick, Fast R-CNN, 2015 IEEE International Conference on Computer Vision (ICCV), pp.1440-1448, 2015.
DOI : 10.1109/ICCV.2015.169

R. Girshick, J. Donahue, T. Darrell, and J. Malik, Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation, 2014 IEEE Conference on Computer Vision and Pattern Recognition, pp.580-587, 2014.
DOI : 10.1109/CVPR.2014.81

URL : http://arxiv.org/pdf/1311.2524

I. Goodfellow, J. Pouget-abadie, M. Mirza, B. Xu, D. Warde-farley et al., Generative adversarial nets, Advances in neural information processing systems, pp.2672-2680, 2014.

K. Greff, J. Sjoerd-van-steenkiste, and . Schmidhuber, Neural expectation maximization, Advances in neural information processing systems, 2017.

D. Gregory, . Hager, N. Peter, and . Belhumeur, Efficient region tracking with parametric models of geometry and illumination, IEEE transactions on pattern analysis and machine intelligence, vol.20, issue.10, pp.1025-1039, 1998.

R. Hartley and A. Zisserman, Multiple view geometry in computer vision, 2003.
DOI : 10.1017/CBO9780511811685

K. He, X. Zhang, S. Ren, and J. Sun, Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition, Proceedings of the European Conference on Computer Vision, pp.346-361, 2014.

R. Horaud, F. Forbes, M. Yguel, G. Dewaele, and J. Zhang, Rigid and Articulated Point Registration with Expectation Conditional Maximization, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.33, issue.3, pp.587-602, 2011.
DOI : 10.1109/TPAMI.2010.94

URL : https://hal.archives-ouvertes.fr/inria-00435772

J. Hosang, R. Benenson, and B. Schiele, How good are detection proposals, really?, Proceedings of the British Machine Vision Conference 2014, 2014.
DOI : 10.5244/C.28.24

URL : http://www.bmva.org/bmvc/2014/files/abstract082.pdf

G. Andrew, M. Howard, B. Zhu, D. Chen, W. Kalenichenko et al., Mobilenets : Efficient convolutional neural networks for mobile vision applications. arXiv preprint, 2017.

H. David, . Hubel, N. Torsten, and . Wiesel, Receptive fields, binocular interaction and functional architecture in the cat's visual cortex, The Journal of physiology, vol.160, issue.1, pp.106-154, 1962.

Q. Du and . Huynh, Metrics for 3d rotations : Comparison and analysis, Journal of Mathematical Imaging and Vision, vol.35, issue.2, pp.155-164, 2009.

S. Iizuka, E. Simo-serra, and H. Ishikawa, Globally and locally consistent image completion, ACM Transactions on Graphics, vol.36, issue.4, p.107, 2017.
DOI : 10.1109/CVPR.2017.434

Y. Jia, E. Shelhamer, J. Donahue, S. Karayev, J. Long et al., Caffe, Proceedings of the ACM International Conference on Multimedia, MM '14, 2014.
DOI : 10.1145/2647868.2654889

F. Jurie and M. Dhome, Real Time Robust Template Matching, Procedings of the British Machine Vision Conference 2002, pp.1-10, 2002.
DOI : 10.5244/C.16.10

URL : https://hal.archives-ouvertes.fr/inria-00548254

J. Karlekar, S. Z. Zhou, W. Lu, Y. Loh-zhi-chang, D. Nakayama et al., Positioning, tracking and mapping for outdoor augmentation, 2010 IEEE International Symposium on Mixed and Augmented Reality, pp.175-184, 2010.
DOI : 10.1109/ISMAR.2010.5643567

A. Kendall, M. Grimes, and R. Cipolla, PoseNet: A Convolutional Network for Real-Time 6-DOF Camera Relocalization, 2015 IEEE International Conference on Computer Vision (ICCV), pp.2938-2946, 2015.
DOI : 10.1109/ICCV.2015.336

J. Kim, A. Jeffrey, and . Fessler, Intensity-Based Image Registration Using Robust Correlation Coefficients, IEEE Transactions on Medical Imaging, vol.23, issue.11, pp.1430-1444, 2004.
DOI : 10.1109/TMI.2004.835313

N. Kobyshev, H. Riemenschneider, and L. Van-gool, Matching Features Correctly through Semantic Understanding, 2014 2nd International Conference on 3D Vision, pp.472-479, 2014.
DOI : 10.1109/3DV.2014.15

J. Ko?ecká and W. Zhang, Extraction, matching, and pose recovery based on dominant rectangular structures, Computer Vision and Image Understanding, vol.100, issue.3, pp.274-293, 2005.
DOI : 10.1016/j.cviu.2005.04.005

M. Kozinski, R. Gadde, S. Zagoruyko, G. Obozinski, and R. Marlet, A MRF shape prior for facade parsing with occlusions, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.2820-2828, 2015.
DOI : 10.1109/CVPR.2015.7298899

URL : https://hal.archives-ouvertes.fr/hal-01232598

A. Krizhevsky, I. Sutskever, and G. E. Hinton, ImageNet classification with deep convolutional neural networks, Advances in neural information processing systems, pp.1097-1105, 2012.
DOI : 10.1162/neco.2009.10-08-881

URL : http://dl.acm.org/ft_gateway.cfm?id=3065386&type=pdf

A. Krizhevsky, I. Sutskever, and G. E. Hinton, ImageNet classification with deep convolutional neural networks, Communications of the ACM, vol.60, issue.6
DOI : 10.1162/neco.2009.10-08-881

URL : http://dl.acm.org/ft_gateway.cfm?id=3065386&type=pdf

S. Lazebnik, C. Schmid, and J. Ponce, Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 2 (CVPR'06), pp.2169-2178, 2006.
DOI : 10.1109/CVPR.2006.68

URL : https://hal.archives-ouvertes.fr/inria-00548585

Y. Lecun, L. Bottou, Y. Bengio, and P. Haffner, Gradient-based learning applied to document recognition, Proceedings of the IEEE, vol.86, issue.11, pp.2278-2324, 1998.
DOI : 10.1109/5.726791

URL : http://www.cs.berkeley.edu/~daf/appsem/Handwriting/papers/00726791.pdf

C. Ledig, L. Theis, F. Huszár, J. Caballero, A. Cunningham et al., Photorealistic single image super-resolution using a generative adversarial network. arXiv preprint, 2016.
DOI : 10.1109/cvpr.2017.19

URL : http://arxiv.org/pdf/1609.04802

J. Lezama, R. Grompone-von-gioi, G. Randall, and J. Morel, Finding Vanishing Points via Point Alignments in Image Primal and Dual Domains, 2014 IEEE Conference on Computer Vision and Pattern Recognition, pp.509-515, 2014.
DOI : 10.1109/CVPR.2014.72

Y. Li, N. Snavely, P. Daniel, and . Huttenlocher, Location Recognition Using Prioritized Feature Matching, European conference on computer vision, pp.791-804, 2010.
DOI : 10.1007/978-3-642-15552-9_57

URL : http://www.cs.cornell.edu/%7Edph/papers/localization.pdf

F. Liu and S. Seipel, Detection of Facade Regions in Street View Images from Splitand-Merge of Perspective Patches, Journal of Image and Graphics, vol.2, issue.1, pp.8-14, 2014.

J. Liu and Y. Liu, Local Regularity-Driven City-Scale Facade Detection from Aerial Images, 2014 IEEE Conference on Computer Vision and Pattern Recognition, pp.3778-3785, 2014.
DOI : 10.1109/CVPR.2014.489

URL : http://vision.cse.psu.edu/publications/pdfs/2014liuUrban/

J. Long, E. Shelhamer, and T. Darrell, Fully convolutional networks for semantic segmentation, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.3431-3440, 2015.
DOI : 10.1109/CVPR.2015.7298965

URL : http://arxiv.org/pdf/1411.4038

G. David and . Lowe, Distinctive image features from scale-invariant keypoints, Int. J. Comput. Vision, vol.60, issue.2, pp.91-110, 2004.

S. Lowry, N. Sünderhauf, P. Newman, J. John, D. Leonard et al., Visual Place Recognition: A Survey, IEEE Transactions on Robotics, vol.32, issue.1, pp.1-19, 2016.
DOI : 10.1109/TRO.2015.2496823

D. Bruce, T. Lucas, and . Kanade, An iterative image registration technique with an application to stereo vision, Proceedings of the 7th International Joint Conference on Artificial Intelligence (IJCAI), pp.647-679, 1981.

S. Mallat, Group Invariant Scattering, Communications on Pure and Applied Mathematics, vol.37, issue.10, pp.1331-1398, 2012.
DOI : 10.1137/S0036141002404838

URL : http://arxiv.org/pdf/1101.2286

S. Mallat, Understanding deep convolutional networks, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, vol.374, issue.2065, p.20150203, 2016.
DOI : 10.1109/CVPR.2015.7298904

URL : http://rsta.royalsocietypublishing.org/content/roypta/374/2065/20150203.full.pdf

E. Marchand, H. Uchiyama, and F. Spindler, Pose Estimation for Augmented Reality: A Hands-On Survey, IEEE Transactions on Visualization and Computer Graphics, vol.22, issue.12, pp.2633-2651, 2016.
DOI : 10.1109/TVCG.2015.2513408

URL : https://hal.archives-ouvertes.fr/hal-01246370

A. Martinovic, M. Mathias, J. Weissenberg, and L. J. Van-gool, A Three-Layered Approach to Facade Parsing, Proceedings of the European Conference on Computer Vision, pp.416-429, 2012.
DOI : 10.1007/978-3-642-33786-4_31

D. Mattes, R. David, H. Haynor, . Vesselle, K. Thomas et al., Nonrigid multimodality image registration, Medical imaging, vol.4322, issue.1, pp.1609-1620, 2001.
DOI : 10.1117/12.431046

B. Micusík, H. Wildenauer, and J. Kosecka, Detection and matching of rectilinear structures, 2008 IEEE Conference on Computer Vision and Pattern Recognition, pp.1-7, 2008.
DOI : 10.1109/CVPR.2008.4587488

J. Michael, . Milford, F. Gordon, and . Wyeth, Seqslam : Visual route-based navigation for sunny summer days and stormy winter nights, Robotics and Automation (ICRA), 2012 IEEE International Conference on, pp.1643-1649, 2012.

M. Faraz, . Mirzaei, I. Stergios, and . Roumeliotis, Optimal estimation of vanishing points in a manhattan world, Computer Vision (ICCV), 2011 IEEE International Conference on, pp.2454-2461, 2011.

M. Mohan, D. Gálvez-lópez, C. Monteleoni, and G. Sibley, Environment selection and hierarchical place recognition, 2015 IEEE International Conference on Robotics and Automation (ICRA), pp.5487-5494, 2015.
DOI : 10.1109/ICRA.2015.7139966

F. Monti, D. Boscaini, J. Masci, E. R. Svoboda, M. Michael et al., Geometric deep learning on graphs and manifolds using mixture model cnns. arXiv preprint, 2016.
DOI : 10.1109/cvpr.2017.576

URL : http://arxiv.org/pdf/1611.08402

D. Ok, M. Kozinski, R. Marlet, and N. Paragios, High-Level Bottom-Up Cues for Top-Down Parsing of Facade Images, 2012 Second International Conference on 3D Imaging, Modeling, Processing, Visualization & Transmission, pp.128-135, 2012.
DOI : 10.1109/3DIMPVT.2012.25

URL : https://hal.archives-ouvertes.fr/hal-00743043

A. Oliva and A. Torralba, Modeling the shape of the scene : A holistic representation of the spatial envelope, International Journal of Computer Vision, vol.42, issue.3, pp.145-175, 2001.
DOI : 10.1023/A:1011139631724

F. Perronnin, J. Sánchez, and T. Mensink, Improving the Fisher Kernel for Large-Scale Image Classification, Computer Vision?ECCV, pp.143-156, 2010.
DOI : 10.1007/978-3-642-15561-1_11

URL : https://hal.archives-ouvertes.fr/inria-00548630

P. Josien, . Pluim, M. A. Jb-antoine-maintz, and . Viergever, Mutual-information-based registration of medical images : a survey, IEEE transactions on medical imaging, vol.22, issue.8, pp.986-1004, 2003.

B. Srinivasa, R. Biswanath, and N. Chatterji, An fft-based technique for translation, rotation, and scale-invariant image registration, IEEE Transactions on Image Processing, vol.5, issue.8, pp.1266-1271, 1996.

G. Reitmayr and T. Drummond, Going out: robust model-based tracking for outdoor augmented reality, 2006 IEEE/ACM International Symposium on Mixed and Augmented Reality, pp.109-118, 2006.
DOI : 10.1109/ISMAR.2006.297801

URL : http://mi.eng.cam.ac.uk/~gr281/docs/ReitmayrIsmar06GoingOut.pdf

I. Rocco, R. Arandjelovi?, and J. Sivic, Convolutional neural network architecture for geometric matching. arXiv preprint, 2017.
DOI : 10.1109/cvpr.2017.12

URL : https://hal.archives-ouvertes.fr/hal-01513001

O. Ronneberger, P. Fischer, and T. Brox, U-Net: Convolutional Networks for Biomedical Image Segmentation, International Conference on Medical Image Computing and Computer-Assisted Intervention, pp.234-241, 2015.
DOI : 10.1007/978-3-319-24574-4_28

URL : http://arxiv.org/pdf/1505.04597

E. Rosten and T. Drummond, Machine Learning for High-Speed Corner Detection, Computer Vision?ECCV, vol.1, pp.430-443, 2006.
DOI : 10.1109/ICNN.1995.489004

URL : http://mi.eng.cam.ac.uk/~er258/work/rosten_2006_machine.ps.gz

E. Rublee, V. Rabaud, K. Konolige, and G. Bradski, ORB: An efficient alternative to SIFT or SURF, 2011 International Conference on Computer Vision, pp.2564-2571, 2011.
DOI : 10.1109/ICCV.2011.6126544

URL : http://www.willowgarage.com/sites/default/files/orb_final.pdf

O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh et al., ImageNet Large Scale Visual Recognition Challenge, International Journal of Computer Vision, vol.1010, issue.1, pp.211-252, 2015.
DOI : 10.1007/978-3-642-15555-0_11

URL : http://dspace.mit.edu/bitstream/1721.1/104944/1/11263_2015_Article_816.pdf

G. Schindler, M. Brown, and R. Szeliski, City-Scale Location Recognition, 2007 IEEE Conference on Computer Vision and Pattern Recognition, pp.1-7, 2007.
DOI : 10.1109/CVPR.2007.383150

H. Peter and . Schönemann, A generalized solution of the orthogonal procrustes problem, Psychometrika, vol.31, issue.1, pp.1-10, 1966.

G. Simon, A. Fond, and M. Berger, A Simple and Effective Method to Detect Orthogonal Vanishing Points in Uncalibrated Images of Man-Made Environments, Proceedings of Eurographics, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01275628

K. Simonyan and A. Zisserman, Very Deep Convolutional Networks for Large-Scale Image Recognition. CoRR, abs, 1409.

J. Sivic and A. Zisserman, Video Google: a text retrieval approach to object matching in videos, Proceedings Ninth IEEE International Conference on Computer Vision, p.1470, 2003.
DOI : 10.1109/ICCV.2003.1238663

R. Smriti, . Stredney, B. Schmalbrock, and . Clymer, Image registration using rigid registration and maximization of mutual information, MMVR13. The 13th Annual Medicine Meets Virtual Reality Conference, p.74, 2005.

N. Suenderhauf, S. Shirazi, A. Jacobson, F. Dayoub, E. Pepperell et al., Place Recognition with ConvNet Landmarks: Viewpoint-Robust, Condition-Robust, Training-Free, Robotics: Science and Systems XI, 2015.
DOI : 10.15607/RSS.2015.XI.022

J. Tardif, Non-iterative approach for fast and accurate vanishing point detection, 2009 IEEE 12th International Conference on Computer Vision, pp.1250-1257, 2009.
DOI : 10.1109/ICCV.2009.5459328

URL : http://www-etud.iro.umontreal.ca/~tardifj/fichiers/Tardif_ICCV2009.pdf

O. Teboul, I. Kokkinos, L. Simon, P. Koutsourakis, and N. Paragios, Parsing Facades with Shape Grammars and Reinforcement Learning, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.35, issue.7, pp.1744-1756, 2013.
DOI : 10.1109/TPAMI.2012.252

URL : https://hal.archives-ouvertes.fr/hal-00855609

C. Toft, C. Olsson, F. Kahl, D. D. Gregorio, T. Cavallari et al., Long-Term 3D Localization and Pose from Semantic Labellings, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW), pp.650-659, 2017.
DOI : 10.1109/ICCVW.2017.83

A. Torii, J. Sivic, T. Pajdla, and M. Okutomi, Visual place recognition with repetitive structures, Proceedings of the IEEE conference on computer vision and pattern recognition, pp.883-890, 2013.
DOI : 10.1109/cvpr.2013.119

URL : https://hal.archives-ouvertes.fr/hal-00934288

J. R. Uijlings, K. E. Van-de-sande, T. Gevers, and A. W. Smeulders, Selective Search for Object Recognition, International Journal of Computer Vision, vol.57, issue.1, p.154, 2013.
DOI : 10.1023/B:VISI.0000013087.49260.fb

URL : http://www.science.uva.nl/research/publications/2011/vandeSandeICCV2011/vandesande_iccv2011.pdf

D. Ulyanov, A. Vedaldi, and V. Lempitsky, Deep image prior. arXiv preprint, 2017.

P. Viola and M. J. Jones, Robust Real-Time Face Detection, International Journal of Computer Vision, vol.57, issue.2, pp.137-154, 2004.
DOI : 10.1023/B:VISI.0000013087.49260.fb

URL : http://csdl.computer.org/comp/proceedings/iccv/2001/1143/02/114320747.pdf

P. Viola, M. William, and I. Wells, Alignment by maximization of mutual information, International Journal of Computer Vision, vol.24, issue.2, pp.137-154, 1997.
DOI : 10.1023/A:1007958904918

R. G. , V. Gioi, J. Jakubowicz, J. Morel, and G. Randall, Lsd : A fast line segment detector with a false detection control, IEEE transactions on pattern analysis and machine intelligence, vol.32, issue.4, pp.722-732, 2010.

S. Wang, M. Bai, G. Mattyus, H. Chu, W. Luo et al., TorontoCity: Seeing the World with a Million Eyes, 2017 IEEE International Conference on Computer Vision (ICCV), pp.3009-3017, 2017.
DOI : 10.1109/ICCV.2017.327

Y. Xu, S. Oh, and A. Hoogs, A Minimum Error Vanishing Point Detection Approach for Uncalibrated Monocular Images of Man-Made Environments, 2013 IEEE Conference on Computer Vision and Pattern Recognition, pp.1376-1383, 2013.
DOI : 10.1109/CVPR.2013.181

C. Yang, T. Han, L. Quan, and C. Tai, Parsing facade with rank-one approximation, CVPR, pp.1720-1727, 2012.

E. Kwang-moo-yi, V. Trulls, P. Lepetit, and . Fua, Lift : Learned invariant feature transform, European Conference on Computer Vision, pp.467-483, 2016.

M. Zhai, S. Workman, and N. Jacobs, Detecting vanishing points using global image context in a non-manhattan world, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp.5657-5665, 2016.

Z. Zhang, A. Ganesh, X. Liang, and Y. Ma, TILT: Transform Invariant Low-Rank Textures, International Journal of Computer Vision, vol.21, issue.1, pp.1-24, 2012.
DOI : 10.1137/100781894

J. Zhao, M. Mathieu, R. Goroshin, and Y. Lecun, Stacked what-where auto-encoders. arXiv preprint, 2015.

C. , L. Zitnick, and P. Dollár, Edge Boxes : Locating Object Proposals from Edges, Proceedings of the European Conference on Computer Vision, pp.391-405, 2014.
DOI : 10.1007/978-3-319-10602-1_26

URL : http://research.microsoft.com/en-us/um/people/larryz/ZitnickDollarECCV14edgeBoxes.pdf

S. Zokai and G. Wolberg, Image registration using log-polar mappings for recovery of large-scale similarity and projective transformations, IEEE Transactions on Image Processing, vol.14, issue.10, pp.1422-1434, 2005.
DOI : 10.1109/TIP.2005.854501