Weakly-Supervised Semantic Segmentation using Motion Cues

Pavel Tokmakov; Karteek Alahari; Cordelia Schmid

doi:10.1007/978-3-319-46493-0_24

Communication Dans Un Congrès Année : 2016

Weakly-Supervised Semantic Segmentation using Motion Cues

(1) , (1, 2) , (1)

1
2

Pavel Tokmakov

Fonction : Auteur

Apprentissage de modèles à partir de données massives

Karteek Alahari

Fonction : Auteur
PersonId : 19670
IdHAL : karteek
ORCID : 0000-0002-1838-5936
IdRef : 196283892

Apprentissage de modèles à partir de données massives

Microsoft Research - Inria Joint Centre

Cordelia Schmid

Fonction : Auteur

Apprentissage de modèles à partir de données massives

Résumé

Fully convolutional neural networks (FCNNs) trained on a large number of images with strong pixel-level annotations have become the new state of the art for the semantic segmentation task. While there have been recent attempts to learn FCNNs from image-level weak annotations , they need additional constraints, such as the size of an object , to obtain reasonable performance. To address this issue, we present motion-CNN (M-CNN), a novel FCNN framework which incorporates motion cues and is learned from video-level weak annotations. Our learning scheme to train the network uses motion segments as soft constraints, thereby handling noisy motion information. When trained on weakly-annotated videos, our method outperforms the state-of-the-art approach on the PASCAL VOC 2012 image segmentation benchmark. We also demonstrate that the performance of M-CNN learned with 150 weak video annotations is on par with state-of-the-art weakly-supervised methods trained with thousands of images. Finally, M-CNN substantially out-performs recent approaches in a related task of video co-localization on the YouTube-Objects dataset.

Mots clés

Semantic segmentation Weakly-supervised learning

Domaines

Vision par ordinateur et reconnaissance de formes [cs.CV] Informatique [cs]

Fichier principal

mcnn.pdf (1.25 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

THOTH Team : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01292794

Soumis le : mardi 2 août 2016-17:26:12

Dernière modification le : jeudi 4 avril 2024-20:52:23

Archivage à long terme le : mardi 8 novembre 2016-20:45:09

Dates et versions

hal-01292794 , version 1 (23-03-2016)

hal-01292794 , version 2 (28-07-2016)

hal-01292794 , version 3 (02-08-2016)

Identifiants

HAL Id : hal-01292794 , version 3
DOI : 10.1007/978-3-319-46493-0_24

Citer

Pavel Tokmakov, Karteek Alahari, Cordelia Schmid. Weakly-Supervised Semantic Segmentation using Motion Cues. ECCV - European Conference on Computer Vision, Oct 2016, Amsterdam, Netherlands. pp.388-404, ⟨10.1007/978-3-319-46493-0_24⟩. ⟨hal-01292794v3⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA CNRS INRIA LJK LJK_GI INRIA2 LJK-GI-THOTH

1774 Consultations

1863 Téléchargements

Weakly-Supervised Semantic Segmentation using Motion Cues

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager