A. Antoniou, A. J. Storkey, and H. Edwards, Augmenting Image Classifiers Using Data Augmentation Generative Adversarial Networks, International Conference on Artificial Neural Networks and Machine Learning (ICANN), vol.11141, pp.594-603, 2018.

A. Ayvaci, M. Raptis, and S. Soatto, Occlusion Detection and Motion Estimation with Convex Optimization, Advances in Neural Information Processing Systems (NIPS), pp.100-108, 2010.

A. Ayvaci, M. Raptis, and S. Soatto, Sparse Occlusion Detection with Optical Flow, International Journal of Computer Vision (IJCV), vol.97, issue.3, pp.322-338, 2012.

V. Badrinarayanan, A. Kendall, and R. Cipolla, SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation, IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), vol.39, issue.12, pp.2481-2495, 2017.

M. Bai and R. Urtasun, Deep Watershed Transform for Instance Segmentation, Conference on Computer Vision and Pattern Recognition (CVPR), pp.2858-2866, 2017.

A. Batra, S. Singh, G. Pang, S. Basu, C. Jawahar et al., Improved Road Connectivity by Joint Learning of Orientation and Segmentation, Conference on Computer Vision and Pattern Recognition (CVPR), pp.10385-10393, 2019.

S. Ben-david, J. Blitzer, K. Crammer, A. Kulesza, F. Pereira et al., A theory of learning from different domains, Machine Learning, vol.79, issue.1-2, pp.151-175, 2010.

S. Ben-david, T. Lu, T. Luu, and D. Pál, Impossibility Theorems for Domain Adaptation, International Conference on Artificial Intelligence and Statistics (AISTATS), JMLR.org, JMLR Proceedings, vol.9, pp.129-136, 2010.

. Blender-online-community, Blender -a 3D modelling and rendering package. Blender Foundation, Blender Institute, 2016.

R. Brégier, F. Devernay, L. Leyrit, and J. L. Crowley, Symmetry Aware Evaluation of 3D Object Detection and Pose Estimation in Scenes of Many Parts in Bulk, International Conference on Computer Vision Workshops (ICCVW), pp.2209-2218, 2017.

H. Caesar, J. Uijlings, and V. Ferrari, COCO-Stuff: Thing and Stuff Classes in Context, Conference on Computer Vision and Pattern Recognition (CVPR), pp.1209-1218, 2018.

H. Cai, L. Zhu, and S. Han, ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware, International Conference on Learning Representations, 2019.

L. C. Chen, Y. Zhu, G. Papandreou, F. Schroff, and H. Adam, Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation, European Conference on Computer Vision (ECCV) Part VII, vol.11211, pp.833-851, 2018.

E. D. Cubuk, B. Zoph, D. Mane, V. Vasudevan, and Q. V. Le, AutoAugment: Learning Augmentation Strategies From Data, Conference on Computer Vision and Pattern Recognition (CVPR), pp.113-123, 2019.

J. Dai, K. He, and J. Sun, Instance-Aware Semantic Segmentation via Multi-task Network Cascades, Conference on Computer Vision and Pattern Recognition (CVPR), pp.3150-3158, 2016.

R. Deng, C. Shen, S. Liu, H. Wang, and X. Liu, Learning to Predict Crisp Boundaries, European Conference on Computer Vision (ECCV) Part VI, vol.11210, pp.570-586, 2018.

T. T. Do, A. Nguyen, and I. D. Reid, AffordanceNet: An End-to-End Deep Learning Approach for Object Affordance Detection, International Conference on Robotics and Automation (ICRA), pp.1-5, 2018.

X. Dong, Y. Yan, W. Ouyang, and Y. Yang, Style Aggregated Network for Facial Landmark Detection, Conference on Computer Vision and Pattern Recognition (CVPR), pp.379-388, 2018.

D. Eigen, C. Puhrsch, and R. Fergus, Depth Map Prediction from a Single Image using a Multi-Scale Deep Network, Advances in Neural Information Processing Systems (NIPS), pp.2366-2374, 2014.

M. Everingham, S. M. Eslami, L. Gool, C. K. Williams, J. Winn et al., The Pascal Visual Object Classes Challenge: A Retrospective, International Journal of Computer Vision (IJCV), vol.111, issue.1, pp.98-136, 2015.

R. Fan, M. M. Cheng, Q. Hou, T. J. Mu, J. Wang et al., S4Net: Single Stage Salient-Instance Segmentation, Conference on Computer Vision and Pattern Recognition (CVPR), pp.6103-6112, 2019.

P. Follmann, T. Böttger, P. Härtinger, R. König, and M. Ulrich, European Conference on Computer Vision (ECCV) Part X, vol.11214, pp.581-597, 2018.

P. Follmann, R. König, P. Härtinger, M. Klostermann, and T. Böttger, Learning to See the Invisible: Endto-End Trainable Amodal Instance Segmentation, Winter Conference on Applications of Computer Vision, pp.1328-1336, 2019.

H. Fu, C. Wang, D. Tao, and M. J. Black, Occlusion Boundary Detection via Deep Exploration of Context, Conference on Computer Vision and Pattern Recognition (CVPR), pp.241-250, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01578439

H. Fu, M. Gong, C. Wang, K. Batmanghelich, and D. Tao, Deep Ordinal Regression Network for Monocular Depth Estimation, Conference on Computer Vision and Pattern Recognition (CVPR), pp.2002-2011, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01741163

A. Gaidon, Q. Wang, Y. Cabon, E. Vig, Y. Gan et al., Monocular Depth Estimation with Affinity, Vertical Pooling, and Label Enhancement, Conference on Computer Vision and Pattern Recognition (CVPR), IEEE Computer Society, vol.11207, pp.232-247, 2016.

A. Geiger, P. Lenz, C. Stiller, and R. Urtasun, Vision meets robotics: The KITTI dataset, International Journal of Robotics Research (IJRR), vol.32, issue.11, pp.1231-1237, 2013.

D. Geiger, B. Ladendorf, and A. L. Yuille, Occlusions and binocular stereo, International Journal of Computer Vision (IJCV), vol.14, issue.3, pp.211-226, 1995.

X. Glorot and Y. Bengio, Understanding the difficulty of training deep feedforward neural networks, In: International Conference on Artificial Intelligence and Statistics (AISTATS), JMLR.org, JMLR Proceedings, vol.9, pp.249-256, 2010.

N. Grammalidis and M. G. Strintzis, Disparity and occlusion estimation in multiocular systems and their coding for the communication of multiview image sequences, Transactions on Circuits and Systems for Video Technology (TCSVT), vol.8, pp.328-344, 1998.

M. Grard, R. Brégier, F. Sella, E. Dellandréa, and L. Chen, Object Segmentation in Depth Maps with One User Click and a Synthetically Trained Fully Convolutional Network, 2017 International Workshop on Human-Friendly Robotics, vol.7, pp.207-221, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01674511

S. Guan, A. A. Khan, S. Sikdar, and P. V. Chitnis, Fully Dense UNet for 2D Sparse Photoacoustic Tomography Artifact Removal, Conference on Computer Vision and Pattern Recognition (CVPR), pp.587-595, 2017.

K. He, G. Gkioxari, P. Dollár, and R. B. Girshick, International Conference on Computer Vision (ICCV), pp.2980-2988, 2017.

X. He and A. Yuille, Occlusion Boundary Detection Using Pseudo-depth, European Conference on Computer Vision (ECCV) Part IV, vol.6314, pp.539-552, 2010.

G. Huang, Z. Liu, L. Van-der-maaten, and K. Q. Weinberger, Densely Connected Convolutional Networks, Conference on Computer Vision and Pattern Recognition (CVPR), pp.2261-2269, 2017.

A. Humayun, M. Aodha, O. Brostow, and G. J. , Learning to find occlusion regions, Conference on Computer Vision and Pattern Recognition (CVPR), pp.2161-2168, 2011.

Y. Jia, E. Shelhamer, J. Donahue, S. Karayev, J. Long et al., Caffe: Convolutional Architecture for Fast Feature Embedding, International Conference on Multimedia, ACM, MM'14, pp.675-678, 2014.

A. Kendall, Y. Gal, and R. Cipolla, Multi-Task Learning Using Uncertainty to Weigh Losses for Scene Geometry and Semantics, Conference on Computer Vision and Pattern Recognition (CVPR), pp.7482-7491, 2018.

D. P. Kingma and J. Ba, Adam: A Method for Stochastic Optimization, International Conference on Learning Representations, 2015.

A. Kirillov, E. Levinkov, B. Andres, B. Savchynskyy, and C. Rother, InstanceCut: From Edges to Instances with MultiCut, Conference on Computer Vision and Pattern Recognition (CVPR), pp.7322-7331, 2017.

A. Kirillov, Y. Wu, K. He, and R. B. Girshick, PointRend: Image Segmentation as Rendering, p.8193, 1912.

S. Kong and C. C. Fowlkes, Recurrent Pixel Embedding for Instance Grouping, Conference on Computer Vision and Pattern Recognition (CVPR), pp.9018-9028, 2018.

W. Lee, J. Na, and G. Kim, Multi-Task Self-Supervised Object Detection via Recycling of Bounding Box Annotations, Conference on Computer Vision and Pattern Recognition (CVPR), pp.4984-4993, 2019.

B. Li, C. Shen, Y. Dai, A. Van-den-hengel, and M. He, Depth and surface normal estimation from monocular images using regression on deep features and hierarchical CRFs, Conference on Computer Vision and Pattern Recognition (CVPR), pp.1119-1127, 2015.

G. Li, Y. Xie, L. Lin, and Y. Yu, Conference on Computer Vision and Pattern Recognition (CVPR), pp.247-256, 2017.

T. Y. Lin, M. Maire, S. Belongie, J. Hays, P. Perona et al., Microsoft COCO: Common Objects in Context, European Conference on Computer Vision (ECCV) Part V, vol.8693, pp.740-755, 2014.

T. Y. Lin, P. Goyal, R. B. Girshick, K. He, and P. Dollár, Focal Loss for Dense Object Detection, International Conference on Computer Vision (ICCV), pp.2999-3007, 2017.

F. Liu, C. Shen, G. Lin, and R. Id, Learning Depth from Single Monocular Images Using Deep Convolutional Neural Fields. IEEE Transactions on Pattern Analysis Machine Intelligence (TPAMI), vol.38, pp.2024-2039, 2016.

G. Liu, J. Si, Y. Hu, and S. Li, Photographic image synthesis with improved U-net, International Conference on Advanced Computational Intelligence (ICACI), pp.402-407, 2018.

R. Liu, J. Lehman, P. Molino, F. P. Such, E. Frank et al., An Intriguing Failing of Convolutional Neural Networks and the CoordConv Solution, Advances in Neural Information Processing Systems (NeurIPS), pp.9628-9639, 2018.

S. Liu, L. Qi, H. Qin, J. Shi, and J. Jia, Path Aggregation Network for Instance Segmentation, Conference on Computer Vision and Pattern Recognition (CVPR), pp.8759-8768, 2018.

S. Liu, E. Johns, and A. J. Davison, End-to-End Multi-Task Learning with Attention, Conference on Computer Vision and Pattern Recognition (CVPR), pp.1871-1880, 2019.

Y. Liu, M. M. Cheng, X. Hu, K. Wang, and X. Bai, Richer Convolutional Features for Edge Detection, Conference on Computer Vision and Pattern Recognition (CVPR), pp.5872-5881, 2017.

P. Luo, G. Wang, L. Lin, and X. Wang, Deep Dual Learning for Semantic Image Segmentation, International Conference on Computer Vision (ICCV), pp.2737-2745, 2017.

K. K. Maninis, J. Pont-tuset, P. A. Arbeláez, and L. Gool, Convolutional Oriented Boundaries, European Conference on Computer Vision (ECCV) Part I, vol.9905, pp.580-596, 2016.

D. Martin, C. Fowlkes, D. Tal, and J. Malik, A Database of Human Segmented Natural Images and its Application to Evaluating Segmentation Algorithms and Measuring Ecological Statistics, International Conference on Computer Vision (ICCV), pp.416-423, 2001.

J. Mccormac, A. Handa, S. Leutenegger, and A. J. Davison, SceneNet RGB-D: Can 5M Synthetic Images Beat Generic ImageNet Pre-training on Indoor Segmentation? In: International Conference on Computer Vision (ICCV), pp.2697-2706, 2017.

I. Misra, A. Shrivastava, A. Gupta, and M. Hebert, Cross-Stitch Networks for Multi-task Learning, Conference on Computer Vision and Pattern Recognition (CVPR), pp.3994-4003, 2016.

D. Novotný, S. Albanie, D. Larlus, and A. Vedaldi, Semi-convolutional Operators for Instance Segmentation, European Conference on Computer Vision (ECCV) Part I, vol.11205, pp.89-105, 2018.

J. Pont-tuset, P. Arbelaez, J. T. Barron, F. Marqus, and J. Malik, Multiscale Combinatorial Grouping for Image Segmentation and Object Proposal Generation, IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), vol.39, issue.1, pp.128-140, 2017.

L. Qi, L. Jiang, S. Liu, X. Shen, and J. Jia, Amodal Instance Segmentation With KINS Dataset, Conference on Computer Vision and Pattern Recognition (CVPR), pp.3014-3023, 2019.

M. Ren and R. S. Zemel, End-to-End Instance Segmentation with Recurrent Attention, Conference on Computer Vision and Pattern Recognition (CVPR), pp.293-301, 2017.

X. Ren, C. C. Fowlkes, and J. Malik, Figure/Ground Assignment in Natural Images, European Conference on Computer Vision (ECCV) Part II, vol.3952, pp.614-627, 2006.

B. Romera-paredes and P. Torr, Recurrent Instance Segmentation, European Conference on Computer Vision (ECCV) Part VI, vol.9910, pp.312-329, 2016.

O. Ronneberger, P. Fischer, and T. Brox, U-Net: Convolutional Networks for Biomedical Image Segmentation, Lecture Notes in Computer Science, pp.234-241, 2015.

G. Ros, L. Sellart, J. Materzynska, D. Vázquez, and A. M. López, The SYNTHIA Dataset: A Large Collection of Synthetic Images for Semantic Segmentation of Urban Scenes, Conference on Computer Vision and Pattern Recognition (CVPR), pp.3234-3243, 2016.

O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh et al., ImageNet Large Scale Visual Recognition Challenge, International Journal of Computer Vision (IJCV), vol.115, issue.3, pp.211-252, 2015.

W. Shi, J. Caballero, F. Huszar, J. Totz, A. P. Aitken et al., Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network, Conference on Computer Vision and Pattern Recognition (CVPR), pp.1874-1883, 2016.

K. Simonyan and A. Zisserman, Very Deep Convolutional Networks for Large-Scale Image Recognition, International Conference on Learning Representations (ICLR), 2006.

D. Sun, C. Liu, and H. Pfister, Local Layering for Joint Motion Estimation and Occlusion Detection, Conference on Computer Vision and Pattern Recognition (CVPR), pp.1098-1105, 2014.

Z. Tang, X. Peng, S. Geng, L. Wu, S. Zhang et al., Quantized Densely Connected U-Nets for Efficient Landmark Localization, European Conference on Computer Vision (ECCV) Part III, vol.11207, pp.348-364, 2018.

G. Wang, X. Wang, F. Li, and X. Liang, DOOBNet: Deep Object Occlusion Boundary Detection from an Image, Asian Conference on Computer Vision (ACCV) Part VI, vol.11366, pp.686-702, 2018.

P. Wang and A. L. Yuille, DOC: Deep OCclusion Estimation from a Single Image, European Conference on Computer Vision (ECCV) Part I, vol.9905, pp.545-561, 2016.

P. Wang, P. Chen, Y. Yuan, D. Liu, Z. Huang et al., Understanding Convolution for Semantic Segmentation, Winter Conference on Applications of Computer Vision (WACV), pp.1451-1460, 2018.

Y. Wang, X. Zhao, and K. Huang, Deep Crisp Boundaries, Conference on Computer Vision and Pattern Recognition (CVPR), pp.1724-1732, 2017.

O. Williams, M. Isard, and J. Maccormick, Estimating Disparity and Occlusions in Stereo Video Sequences, Conference on Computer Vision and Pattern Recognition (CVPR), pp.250-257, 2011.

S. Xie and Z. Tu, Holistically-Nested Edge Detection, International Conference on Computer Vision (ICCV), pp.1395-1403, 2015.

J. Yang, B. L. Price, S. Cohen, H. Lee, and M. H. Yang, Object Contour Detection with a Fully Convolutional Encoder-Decoder Network, Conference on Computer Vision and Pattern Recognition (CVPR), pp.193-202, 2016.

J. Yosinski, J. Clune, Y. Bengio, and H. Lipson, How transferable are features in deep neural networks? In: Advances in Neural Information Processing Systems (NIPS), pp.3320-3328, 2014.

F. Yu and V. Koltun, Multi-Scale Context Aggregation by Dilated Convolutions, International Conference on Learning Representations, 2016.

J. Yu, L. Yang, N. Xu, J. Yang, and T. Huang, Slimmable Neural Networks, International Conference on Learning Representations, 2019.

Z. Yu, W. Liu, Y. Zou, C. Feng, S. Ramalingam et al., Simultaneous Edge Alignment and Learning, European Conference on Computer Vision (ECCV) Part III, vol.11207, pp.400-417, 2018.

L. Zhang, X. Li, A. Arnab, K. Yang, Y. Tong et al., Dual Graph Convolutional Network for Semantic Segmentation, British Machine Vision Conference, 2019.

Y. Zhu, Y. Tian, D. N. Metaxas, and P. Dollár, Semantic Amodal Segmentation, Conference on Computer Vision and Pattern Recognition (CVPR), pp.3001-3009, 2017.

C. L. Zitnick and T. Kanade, A Cooperative Algorithm for Stereo Matching and Occlusion Detection, IEEE Transactions on Pattern Analysis Machine Intelligence (TPAMI), vol.22, issue.7, pp.675-684, 2000.