Semantic-only Visual Odometry based on dense class-level segmentation

Abstract : This paper proposes a novel approach called Semantic Visual Odometry (SemVO) which incorporates class-level consistency priors into the problem of 6-DoF Visual Odometry. Dense class-level labels are learnt for each pixel of the image using a CNN trained for semantic segmentation. A semantic error is formulated penalising the sum of squared differences (SSD) on class-level feature maps extracted from the decoder of a RefineNet. It will be shown how the proposed approach allows dense RGB-D camera tracking using solely a semantic error term. SemVO is evaluated on the ScanNet dataset and the results demonstrate how the number of classes affects performance. Results are also provided showing how best to fuse the new error function with classic dense photometric and geometric methods. Finally, it is demonstrated that SemVO improves over standard approaches for large camera motion applications.
Liste complète des métadonnées
Contributeur : Andrew Comport <>
Soumis le : vendredi 14 septembre 2018 - 14:00:02
Dernière modification le : jeudi 7 février 2019 - 17:48:32
Document(s) archivé(s) le : samedi 15 décembre 2018 - 14:08:49


Fichiers produits par l'(les) auteur(s)


  • HAL Id : hal-01874544, version 1



Howard Mahe, Denis Marraud, Andrew I. Comport. Semantic-only Visual Odometry based on dense class-level segmentation. International Conference on Pattern Recognition (ICPR 2018), Aug 2018, Pékin, China. 〈〉. 〈hal-01874544〉



Consultations de la notice


Téléchargements de fichiers