Unsupervised Object Discovery and Tracking in Video Collections

Suha Kwak; Minsu Cho; Ivan Laptev; Jean Ponce; Cordelia Schmid

doi:10.1109/ICCV.2015.363

Communication Dans Un Congrès Année : 2015

Unsupervised Object Discovery and Tracking in Video Collections

(1, 2) , (1, 2) , (1, 2) , (1, 2) , (3)

1
2
3

Suha Kwak

Fonction : Auteur

Laboratoire d'informatique de l'école normale supérieure

Models of visual object recognition and scene understanding

Minsu Cho

Fonction : Auteur
PersonId : 182032
IdHAL : minsu-cho
IdRef : 253128862

Laboratoire d'informatique de l'école normale supérieure

Models of visual object recognition and scene understanding

Ivan Laptev

Fonction : Auteur

Laboratoire d'informatique de l'école normale supérieure

Models of visual object recognition and scene understanding

Jean Ponce

Fonction : Auteur

Laboratoire d'informatique de l'école normale supérieure

Models of visual object recognition and scene understanding

Cordelia Schmid

Fonction : Auteur

Learning and recognition in vision

Résumé

This paper addresses the problem of automatically localizing dominant objects as spatio-temporal tubes in a noisy collection of videos with minimal or even no supervision. We formulate the problem as a combination of two complementary processes: discovery and tracking. The first one establishes correspondences between prominent regions across videos, and the second one associates successive similar object regions within the same video. Interestingly , our algorithm also discovers the implicit topology of frames associated with instances of the same object class across different videos, a role normally left to supervisory information in the form of class labels in conventional image and video understanding methods. Indeed, as demonstrated by our experiments, our method can handle video collections featuring multiple object classes, and substantially outperforms the state of the art in colocalization, even though it tackles a broader problem with much less supervision.

Domaines

Vision par ordinateur et reconnaissance de formes [cs.CV]

Fichier principal

video_obj_local.pdf (8.24 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Suha Kwak : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01153017

Soumis le : lundi 7 décembre 2015-14:25:33

Dernière modification le : samedi 27 avril 2024-03:09:59

Archivage à long terme le : mardi 8 mars 2016-13:34:12

Dates et versions

hal-01153017 , version 1 (19-05-2015)

hal-01153017 , version 2 (07-12-2015)

Identifiants

HAL Id : hal-01153017 , version 2
ARXIV : 1505.03825
DOI : 10.1109/ICCV.2015.363

Citer

Suha Kwak, Minsu Cho, Ivan Laptev, Jean Ponce, Cordelia Schmid. Unsupervised Object Discovery and Tracking in Video Collections. ICCV - IEEE International Conference on Computer Vision, Dec 2015, Santiago, Chile. pp.3173-3181, ⟨10.1109/ICCV.2015.363⟩. ⟨hal-01153017v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENS-PARIS UGA CNRS INRIA INSMI LJK LJK_GI LJK_GI_LEAR INRIA2 PSL

802 Consultations

483 Téléchargements

Unsupervised Object Discovery and Tracking in Video Collections

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager