Deformable Part-based Fully Convolutional Network for Object Detection

Taylor Mordan; Nicolas Thome; Matthieu Cord; Gilles Henaff

Communication Dans Un Congrès Année : 2017

Deformable Part-based Fully Convolutional Network for Object Detection

(1, 2) , (3) , (2) , (1)

1
2
3

Taylor Mordan

Fonction : Auteur

Thales Optronique S.A.S.

Machine Learning and Information Access

Nicolas Thome

Fonction : Auteur
PersonId : 181803
IdHAL : nicolas-thome
ORCID : 0000-0003-4871-3045
IdRef : 12878332X

Centre d'études et de recherche en informatique et communications

Matthieu Cord

Fonction : Auteur
PersonId : 13617
IdHAL : matthieucord
ORCID : 0000-0002-0627-5844
IdRef : 132968126

Machine Learning and Information Access

Gilles Henaff

Fonction : Auteur

Thales Optronique S.A.S.

Résumé

Existing region-based object detectors are limited to regions with fixed box geometry to represent objects, even if those are highly non-rectangular. In this paper we introduce DP-FCN, a deep model for object detection which explicitly adapts to shapes of objects with deformable parts. Without additional annotations, it learns to focus on discriminative elements and to align them, and simultaneously brings more invariance for classification and geometric information to refine localization. DP-FCN is composed of three main modules: a Fully Convolutional Network to efficiently maintain spatial resolution, a deformable part-based RoI pooling layer to optimize positions of parts and build invariance, and a deformation-aware localization module explicitly exploiting displacements of parts to improve accuracy of bounding box regression. We experimentally validate our model and show significant gains. DP-FCN achieves state-of-the-art performances of 83.1% and 80.9% on PASCAL VOC 2007 and 2012 with VOC data only.

Domaines

Vision par ordinateur et reconnaissance de formes [cs.CV] Apprentissage [cs.LG] Intelligence artificielle [cs.AI]

Taylor Mordan : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01637070

Soumis le : vendredi 17 novembre 2017-11:47:00

Dernière modification le : mardi 11 avril 2023-15:16:28

Dates et versions

hal-01637070 , version 1 (17-11-2017)

Identifiants

HAL Id : hal-01637070 , version 1
ARXIV : 1707.06175

Citer

Taylor Mordan, Nicolas Thome, Matthieu Cord, Gilles Henaff. Deformable Part-based Fully Convolutional Network for Object Detection. British Machine Vision Conference (BMVC), Sep 2017, London, United Kingdom. ⟨hal-01637070⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UPMC CNRS CNAM LIP6 SORBONNE-UNIVERSITE CEDRIC-CNAM SU-SCIENCES HESAM

163 Consultations

0 Téléchargements

Deformable Part-based Fully Convolutional Network for Object Detection

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager