S. Avila, N. Thome, M. Cord, E. E. Valle, and A. D. Araújo, Pooling in image representation: The visual codeword point of view, Computer Vision and Image Understanding, vol.117, issue.5, pp.453-465, 2013.
DOI : 10.1016/j.cviu.2012.09.007

URL : https://hal.archives-ouvertes.fr/hal-01172709

H. Badino, D. Huber, and T. Kanade, Real-time topometric localization, 2012 IEEE International Conference on Robotics and Automation, pp.1635-1642, 2012.
DOI : 10.1109/ICRA.2012.6224716

M. Brubaker, A. Geiger, and R. Urtasun, Lost! Leveraging the Crowd for Probabilistic Visual Self-Localization, 2013 IEEE Conference on Computer Vision and Pattern Recognition, pp.3057-3064, 2013.
DOI : 10.1109/CVPR.2013.393

G. Company, Pittsburgh dataset provided by google for research purposes

M. Cummins and P. Newman, FAB-MAP: Probabilistic Localization and Mapping in the Space of Appearance, The International Journal of Robotics Research, vol.27, issue.6, pp.647-665, 2008.
DOI : 10.1177/0278364908090961

M. Cummins and P. Newman, Appearance-only SLAM at large scale with FAB-MAP 2.0, The International Journal of Robotics Research, vol.2, issue.11, pp.1100-1123, 2011.
DOI : 10.1177/0278364910385483

C. Doersch, S. Singh, A. Gupta, J. Sivic, and A. A. Efros, What makes paris look like paris?, ACM Transactions on Graphics (SIGGRAPH), issue.4, pp.31-2012
URL : https://hal.archives-ouvertes.fr/hal-01053876

M. Fischler and R. Bolles, Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography, Communications of the ACM, vol.24, issue.6, pp.381-395, 1981.
DOI : 10.1145/358669.358692

M. Gebel and C. Weihs, Calibrating classifier scores into probabilities Advances in Data Analysis, 2007.

P. Gronat, G. Obozinski, J. Sivic, and T. Pajdla, Learning and calibrating per-location classifiers for visual place recognition, Proceedings of the Computer Vision and Pattern Recognition conference, pp.907-914, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00934332

J. Hays and A. Efros, IM2GPS: estimating geographic information from a single image, 2008 IEEE Conference on Computer Vision and Pattern Recognition, 2009.
DOI : 10.1109/CVPR.2008.4587784

H. Jegou, F. Perronnin, M. Douze, J. Sanchez, P. Perez et al., Aggregating Local Image Descriptors into Compact Codes, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.34, issue.9, pp.1704-1716, 2012.
DOI : 10.1109/TPAMI.2011.235

URL : https://hal.archives-ouvertes.fr/inria-00633013

J. Knopp and J. S. Pajdla, Avoiding Confusing Features in Place Recognition, Proceedings of the European Conference on Computer Vision, pp.748-761, 2010.
DOI : 10.1007/978-3-642-15549-9_54

M. Law, N. Thome, and M. Cord, Fantope Regularization in Metric Learning, 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014.
DOI : 10.1109/CVPR.2014.138

URL : https://hal.archives-ouvertes.fr/hal-01094074

S. Lazebnik, C. Schmid, and J. Ponce, Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 2 (CVPR'06), pp.2169-2178, 2006.
DOI : 10.1109/CVPR.2006.68

URL : https://hal.archives-ouvertes.fr/inria-00548585

C. Lebarz, N. Thome, M. Cord, S. Herbin, and M. Sanfourche, Global robot ego-localization combining image retrieval and hmm-based filtering, 6th workshop on Planning Perception and Navigation for Autonomous Navigation, 2014.

W. Maddern, M. Milford, and G. Wyeth, CAT-SLAM: probabilistic localisation and mapping using a continuous appearance-based trajectory, The International Journal of Robotics Research, vol.25, issue.4, pp.429-451, 2012.
DOI : 10.1177/0278364912438273

A. Majdik, Y. Albers-schoenberg, and D. Scaramuzza, MAV urban localization from Google street view data, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems, pp.3979-3986, 2013.
DOI : 10.1109/IROS.2013.6696925

B. Mcfee and G. Lanckriet, Metric learning to rank, ICML, 2010.

C. Mcmanus, B. Upcroft, and P. Newmann, Scene Signatures: Localised and Point-less Features for Localisation, Robotics: Science and Systems X, 2014.
DOI : 10.15607/RSS.2014.X.023

M. Milford and G. Wyeth, SeqSLAM: Visual route-based navigation for sunny summer days and stormy winter nights, 2012 IEEE International Conference on Robotics and Automation, pp.1643-1649, 2012.
DOI : 10.1109/ICRA.2012.6224623

E. Pepperell, P. Corke, and M. Milford, All-environment visual place recognition with SMART, 2014 IEEE International Conference on Robotics and Automation (ICRA), pp.1612-1618, 2014.
DOI : 10.1109/ICRA.2014.6907067

L. R. Rabiner, A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition, Proceedings of the IEEE, pp.257-286, 1989.
DOI : 10.1016/B978-0-08-051584-7.50027-9

S. Razavian, H. Azizpour, J. Sullivan, and S. Carlsson, CNN Features Off-the-Shelf: An Astounding Baseline for Recognition, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2014.
DOI : 10.1109/CVPRW.2014.131

G. Schindler, M. Brown, and R. Szeliski, City-Scale Location Recognition, 2007 IEEE Conference on Computer Vision and Pattern Recognition, pp.1-7, 2007.
DOI : 10.1109/CVPR.2007.383150

J. Sivic and A. Zisserman, Video Google: a text retrieval approach to object matching in videos, Proceedings Ninth IEEE International Conference on Computer Vision, pp.1470-1477, 2003.
DOI : 10.1109/ICCV.2003.1238663

G. Vaca-castano, A. Zamir, and M. Shah, City scale geospatial trajectory estimation of a moving camera, Proceedings of the Computer Vision and Pattern Recognition conference, pp.1186-1193, 2012.

A. Vedaldi and B. Fulkerson, Vlfeat, Proceedings of the international conference on Multimedia, MM '10, 2008.
DOI : 10.1145/1873951.1874249

A. Zamir and M. Shah, Accurate Image Localization Based on Google Maps Street View, Proceedings of the European Conference on Computer Vision, pp.255-268, 2010.
DOI : 10.1007/978-3-642-15561-1_19

J. Zhang, A. Hallquist, E. Liang, and A. Zakhor, Locationbased image retrieval for urban environments, Proceedings of the International Conference on Image Processing, pp.3677-3680, 2011.