G. Papandreou, L. C. Chen, K. Murphy, and A. L. Yuille, Weakly-and semi-supervised learning of a DCNN for semantic image segmentation, 2015.

A. Vezhnevets, V. Ferrari, and J. Buhmann, Weakly supervised structured output learning for semantic segmentation, 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012.
DOI : 10.1109/CVPR.2012.6247757

P. O. Pinheiro and R. Collobert, From image-level to pixel-level labeling with Convolutional Networks, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015.
DOI : 10.1109/CVPR.2015.7298780

G. Hartmann, M. Grundmann, J. Hoffman, D. Tsai, V. Kwatra et al., Weakly Supervised Learning of Object Segmentations from Web-Scale Video, 2012.
DOI : 10.1007/978-3-642-33863-2_20

A. Monroy and B. Ommer, Beyond Bounding-Boxes: Learning Object Shape by Model-Driven Grouping, In: ECCV, 2012.
DOI : 10.1007/978-3-642-33712-3_42

J. Wu, Y. Zhao, J. Zhu, S. Luo, and Z. Tu, MILCut: A Sweeping Line Multiple Instance Learning Paradigm for Interactive Image Segmentation, 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014.
DOI : 10.1109/CVPR.2014.40

T. Brox and J. Malik, Object Segmentation by Long Term Analysis of Point Trajectories, In: ECCV, 2010.
DOI : 10.1007/978-3-642-15555-0_21

L. C. Chen, G. Papandreou, I. Kokkinos, K. Murphy, and A. L. Yuille, Semantic image segmentation with deep convolutional nets and fully connected CRFs, In: ICLR, 2015.

C. Farabet, C. Couprie, L. Najman, and Y. Lecun, Learning Hierarchical Features for Scene Labeling, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.35, issue.8, pp.1915-1929, 2013.
DOI : 10.1109/TPAMI.2012.231

URL : https://hal.archives-ouvertes.fr/hal-00742077

J. Long, E. Shelhamer, and T. Darrell, Fully convolutional networks for semantic segmentation, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015.
DOI : 10.1109/CVPR.2015.7298965

S. Zheng, S. Jayasumana, B. Romera-paredes, V. Vineet, Z. Su et al., Conditional Random Fields as Recurrent Neural Networks, 2015 IEEE International Conference on Computer Vision (ICCV), 2015.
DOI : 10.1109/ICCV.2015.179

D. Pathak, E. Shelhamer, J. Long, and T. Darrell, Fully convolutional multi-class multiple instance learning, In: ICLR, 2015.

D. Pathak, P. Krähenbühl, and T. Darrell, Constrained Convolutional Neural Networks for Weakly Supervised Segmentation, 2015 IEEE International Conference on Computer Vision (ICCV), 2015.
DOI : 10.1109/ICCV.2015.209

A. Papazoglou and V. Ferrari, Fast Object Segmentation in Unconstrained Video, 2013 IEEE International Conference on Computer Vision, 2013.
DOI : 10.1109/ICCV.2013.223

A. Prest, C. Leistner, J. Civera, C. Schmid, and V. Ferrari, Learning object class detectors from weakly annotated video, 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012.
DOI : 10.1109/CVPR.2012.6248065

URL : https://hal.archives-ouvertes.fr/hal-00695940

M. Everingham, L. Van-gool, C. K. Williams, J. Winn, and A. Zisserman, The PASCAL Visual Object Classes Challenge 2012 (VOC2012) Results

S. Kwak, M. Cho, I. Laptev, J. Ponce, and C. Schmid, Unsupervised Object Discovery and Tracking in Video Collections, 2015 IEEE International Conference on Computer Vision (ICCV), 2015.
DOI : 10.1109/ICCV.2015.363

URL : https://hal.archives-ouvertes.fr/hal-01153017

J. Carreira and C. Sminchisescu, CPMC: Automatic Object Segmentation Using Constrained Parametric Min-Cuts, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.34, issue.7, pp.1312-1328, 2012.
DOI : 10.1109/TPAMI.2011.231

J. Carreira, R. Caseiro, J. Batista, and C. Sminchisescu, Semantic Segmentation with Second-Order Pooling, In: ECCV, 2012.
DOI : 10.1007/978-3-642-33786-4_32

G. Lin, C. Shen, A. Van-dan-hengel, and I. Reid, Efficient Piecewise Training of Deep Structured Models for Semantic Segmentation, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016.
DOI : 10.1109/CVPR.2016.348

Y. Lecun, B. Boser, J. S. Denker, D. Henderson, R. E. Howard et al., Backpropagation Applied to Handwritten Zip Code Recognition, Neural Computation, vol.1, issue.4, pp.541-551, 1989.
DOI : 10.1007/BF00133697

A. Krizhevsky, I. Sutskever, and G. E. Hinton, ImageNet classification with deep convolutional neural networks, In: NIPS, 2012.

R. G. Cinbis, J. Verbeek, and C. Schmid, Multi-fold MIL Training for Weakly Supervised Object Localization, 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014.
DOI : 10.1109/CVPR.2014.309

URL : https://hal.archives-ouvertes.fr/hal-00975746

O. Russakovsky, Y. Lin, K. Yu, and L. Fei-fei, Object-Centric Spatial Pooling for Image Classification, In: ECCV, 2012.
DOI : 10.1007/978-3-642-33709-3_1

X. Chen, A. Shrivastava, and A. Gupta, NEIL: Extracting Visual Knowledge from Web Data, 2013 IEEE International Conference on Computer Vision, 2013.
DOI : 10.1109/ICCV.2013.178

S. K. Divvala, A. Farhadi, and C. Guestrin, Learning Everything about Anything: Webly-Supervised Visual Concept Learning, 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014.
DOI : 10.1109/CVPR.2014.412

X. Chen and A. Gupta, Webly Supervised Learning of Convolutional Networks, 2015 IEEE International Conference on Computer Vision (ICCV), 2015.
DOI : 10.1109/ICCV.2015.168

X. Liang, S. Liu, Y. Wei, L. Liu, L. Lin et al., Towards Computational Baby Learning: A Weakly-Supervised Approach for Object Detection, 2015 IEEE International Conference on Computer Vision (ICCV), 2015.
DOI : 10.1109/ICCV.2015.120

C. Rother, T. Minka, A. Blake, and V. Kolmogorov, Cosegmentation of Image Pairs by Histogram Matching - Incorporating a Global Constraint into MRFs, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 1 (CVPR'06), 2006.
DOI : 10.1109/CVPR.2006.91

A. Joulin, K. Tang, and L. Fei-fei, Efficient Image and Video Co-localization with Frank-Wolfe Algorithm, In: ECCV, 2014.
DOI : 10.1007/978-3-319-10599-4_17

K. D. Tang, R. Sukthankar, J. Yagnik, and F. Li, Discriminative Segment Annotation in Weakly Labeled Video, 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013.
DOI : 10.1109/CVPR.2013.321

K. Simonyan and A. Zisserman, Very deep convolutional networks for large-scale image recognition, In: ICLR, 2015.

C. Rother, V. Kolmogorov, and A. Blake, "GrabCut", ACM Transactions on Graphics, vol.23, issue.3, pp.309-314, 2004.
DOI : 10.1145/1015706.1015720

Y. Boykov and M. P. Jolly, Interactive graph cuts for optimal boundary & region segmentation of objects in N-D images, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001, 2001.
DOI : 10.1109/ICCV.2001.937505

Y. Boykov, O. Veksler, and R. Zabih, Fast approximate energy minimization via graph cuts, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.23, issue.11, pp.1222-1239, 2001.
DOI : 10.1109/34.969114

B. Hariharan, P. Arbelaez, L. Bourdev, S. Maji, and J. Malik, Semantic contours from inverse detectors, 2011 International Conference on Computer Vision, 2011.
DOI : 10.1109/ICCV.2011.6126343

Y. Jia, E. Shelhamer, J. Donahue, S. Karayev, J. Long et al., Caffe, Proceedings of the ACM International Conference on Multimedia, MM '14, 2014.
DOI : 10.1145/2647868.2654889

M. Mostajabi, P. Yadollahpour, and G. Shakhnarovich, Feedforward semantic segmentation with zoom-out features, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015.
DOI : 10.1109/CVPR.2015.7298959

R. Achanta, A. Shaji, K. Smith, A. Lucchi, P. Fua et al., SLIC Superpixels Compared to State-of-the-Art Superpixel Methods, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.34, issue.11, pp.2274-2282, 2012.
DOI : 10.1109/TPAMI.2012.120