Visual geo-localization of non-photographic depictions via 2D-3D alignment

Mathieu Aubry 1, 2, 3 Bryan C. Russell 4 Josef Sivic 5, 6
6 WILLOW - Models of visual object recognition and scene understanding
DI-ENS - Département d'informatique de l'École normale supérieure, ENS Paris - École normale supérieure - Paris, Inria Paris-Rocquencourt, CNRS - Centre National de la Recherche Scientifique : UMR8548
Abstract : This chapter describes a technique that can geo-localize arbitrary 2D depictions of architectural sites, including drawings, paintings and historical pho-tographs. This is achieved by aligning the input depiction with a 3D model of the corresponding site. The task is very difficult as the appearance and scene structure in the 2D depictions can be very different from the appearance and geometry of the 3D model, e.g., due to the specific rendering style, drawing error, age, lighting or change of seasons. In addition, we face a hard search problem: the number of possible alignments of the depiction to a set of 3D models from different architec-tural sites is huge. To address these issues, we develop a compact representation of complex 3D scenes. 3D models of several scenes are represented by a set of discrim-inative visual elements that are automatically learnt from rendered views. Similar to object detection, the set of visual elements, as well as the weights of individual features for each element, are learnt in a discriminative fashion. We show that the learnt visual elements are reliably matched in 2D depictions of the scene despite large variations in rendering style (e.g. watercolor, sketch, historical photograph) and structural changes (e.g. missing scene parts, large occluders) of the scene. We demonstrate that the proposed approach can automatically identify the correct archi-tectural site as well as recover an approximate viewpoint of historical photographs and paintings with respect to the 3D model of the site. Fig. 1 Our system automatically geo-localizes paintings, drawings, and historical photographs by recovering their viewpoint with respect to a geo-referenced 3D model of the depicted architec-tural site. Here geo-localized paintings of Notre Dame in Paris are visualized in the Google Earth geobrowser.
Type de document :
Chapitre d'ouvrage
Springer. Visual Analysis and Geolocalization of Large-Scale Imagery, 2015
Liste complète des métadonnées

https://hal.archives-ouvertes.fr/hal-01119203
Contributeur : Mathieu Aubry <>
Soumis le : samedi 21 février 2015 - 23:37:16
Dernière modification le : vendredi 10 février 2017 - 12:57:05
Document(s) archivé(s) le : mardi 26 mai 2015 - 10:21:15

Fichier

chapter02.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01119203, version 1

Collections

Citation

Mathieu Aubry, Bryan C. Russell, Josef Sivic. Visual geo-localization of non-photographic depictions via 2D-3D alignment. Springer. Visual Analysis and Geolocalization of Large-Scale Imagery, 2015. <hal-01119203>

Partager

Métriques

Consultations de
la notice

543

Téléchargements du document

365