Leveraging Large-Scale Uncurated Data for Unsupervised Pre-training of Visual Features

Mathilde Caron; Piotr Bojanowski; Julien Mairal; Armand Joulin

Pré-Publication, Document De Travail Année : 2019

Leveraging Large-Scale Uncurated Data for Unsupervised Pre-training of Visual Features

(1, 2) , (1) , (3, 2) , (1)

1
2
3

Mathilde Caron

Fonction : Auteur
PersonId : 1046708

Facebook AI Research [Paris]

Apprentissage de modèles à partir de données massives

Piotr Bojanowski

Fonction : Auteur
PersonId : 948453

Facebook AI Research [Paris]

Julien Mairal

Fonction : Auteur
PersonId : 1034832
ORCID : 0000-0001-6991-2110
IdRef : 152125256

Department of Statistics [Berkeley]

Apprentissage de modèles à partir de données massives

Armand Joulin

Fonction : Auteur
PersonId : 915272

Facebook AI Research [Paris]

Résumé

Pre-training general-purpose visual features with con-volutional neural networks without relying on annotations is a challenging and important task. Most recent efforts in unsupervised feature learning have focused on either small or highly curated datasets like ImageNet, whereas using uncurated raw datasets was found to decrease the feature quality when evaluated on a transfer task. Our goal is to bridge the performance gap between unsupervised methods trained on curated data, which are costly to obtain, and massive raw datasets that are easily available. To that effect , we propose a new unsupervised approach which leverages self-supervision and clustering to capture complementary statistics from large-scale data. We validate our approach on 96 million images from YFCC100M, achieving state-of-the-art results among unsupervised methods on standard benchmarks, which confirms the potential of unsu-pervised learning when only uncurated data are available. We also show that pre-training a supervised VGG-16 with our method achieves 74.6% top-1 accuracy on the validation set of ImageNet classification, which is an improvement of +0.7% over the same network trained from scratch.

Domaines

Informatique [cs] Vision par ordinateur et reconnaissance de formes [cs.CV] Apprentissage [cs.LG]

Fichier principal

main.pdf (2.18 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Mathilde Caron : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02119564

Soumis le : vendredi 3 mai 2019-19:30:52

Dernière modification le : mercredi 3 avril 2024-12:50:03

Dates et versions

hal-02119564 , version 1 (03-05-2019)

hal-02119564 , version 2 (09-09-2019)

Identifiants

HAL Id : hal-02119564 , version 1

Citer

Mathilde Caron, Piotr Bojanowski, Julien Mairal, Armand Joulin. Leveraging Large-Scale Uncurated Data for Unsupervised Pre-training of Visual Features. 2019. ⟨hal-02119564v1⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

256 Consultations

930 Téléchargements

Leveraging Large-Scale Uncurated Data for Unsupervised Pre-training of Visual Features

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Partager