J. Carreira, P. Agrawal, K. Fragkiadaki, and J. Malik, Human pose estimation with iterative error feedback, 2015.

L. Chen, G. Papandreou, I. Kokkinos, K. Murphy, and A. L. Yuille, Semantic image segmentation with deep convolutional nets and fully connected crfs, 2014.

L. Chen, A. G. Schwing, A. L. Yuille, and R. Urtasun, Learning deep structured models, Proc. ICML, 2015.

R. Collobert, K. Kavukcuoglu, and C. Farabet, Torch7: A matlab-like environment for machine learning, BigLearn, NIPS Workshop, number EPFL-CONF-192376, 2011.

M. Cordts, M. Omran, S. Ramos, T. Rehfeld, M. Enzweiler et al., The cityscapes dataset for semantic urban scene understanding, Proceedings of the IEEE conference on computer vision and pattern recognition, pp.3213-3223, 2016.

D. Eigen and R. Fergus, Predicting depth, surface normals and semantic labels with a common multi-scale convolutional architecture, Proceedings of the IEEE International Conference on Computer Vision, pp.2650-2658, 2015.

N. Einecke and J. Eggert, A multi-block-matching approach for stereo, 2015 IEEE Intelligent Vehicles Symposium (IV), p.11, 2015.

P. Fischer, A. Dosovitskiy, E. Ilg, P. Häusser, C. Haz?rbas¸ et al., Flownet: Learning optical flow with convolutional networks, 2015.

F. Guney and A. Geiger, Displets: Resolving stereo ambiguities using object knowledge, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, p.11, 2015.

M. Havaei, A. Davy, D. Warde-farley, A. Biard, A. Courville et al., Brain tumor segmentation with deep neural networks, Medical Image Analysis, issue.2, 2016.

K. He, X. Zhang, S. Ren, and J. Sun, Deep residual learning for image recognition, vol.7, p.13, 2015.

K. He, X. Zhang, S. Ren, and J. Sun, Delving deep into rectifiers: Surpassing human-level performance on imagenet classification, Proceedings of the IEEE International Conference on Computer Vision, pp.1026-1034, 2015.

B. K. Horn and B. G. Schunck, Determining optical flow, Artificial intelligence, vol.17, issue.1-3, pp.185-203, 1981.

S. Ioffe and C. Szegedy, Batch normalization: Accelerating deep network training by reducing internal covariate shift, 2015.

D. Kingma and J. Ba, Adam: A method for stochastic optimization, 2014.

V. Koltun, Efficient inference in fully connected crfs with gaussian edge potentials, Adv. Neural Inf. Process. Syst, issue.2, 2011.

A. Krizhevsky, I. Sutskever, and G. E. Hinton, Imagenet classification with deep convolutional neural networks, Advances in neural information processing systems, pp.1097-1105, 2012.
DOI : 10.1145/3065386
URL : http://dl.acm.org/ft_gateway.cfm?id=3065386&type=pdf

J. Lafferty, A. Mccallum, and F. Pereira, Conditional random fields: Probabilistic models for segmenting and labeling sequence data, Proceedings of the eighteenth international conference on machine learning, ICML, vol.1, pp.282-289, 2001.

Y. Lecun, L. Bottou, Y. Bengio, and P. Haffner, Gradientbased learning applied to document recognition. Proceedings of the IEEE, vol.86, pp.2278-2324, 1998.

K. Li, B. Hariharan, and J. Malik, Iterative instance segmentation, 2015.

J. Long, E. Shelhamer, and T. Darrell, Fully convolutional networks for semantic segmentation, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, vol.1, p.13, 2015.

W. Luo, A. G. Schwing, and R. Urtasun, Efficient deep learning for stereo matching, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, vol.1, p.7, 2016.

A. L. Maas, A. Y. Hannun, and A. Y. Ng, Rectifier nonlinearities improve neural network acoustic models, Proc. ICML, vol.30, 2013.

N. Mayer, E. Ilg, P. Häusser, P. Fischer, D. Cremers et al., A large dataset to train convolutional networks for disparity, optical flow, and scene flow estimation, vol.8, p.11, 2015.

M. Menze and A. Geiger, Object scene flow for autonomous vehicles, Conference on Computer Vision and Pattern Recognition (CVPR), 2015.

M. Menze, C. Heipke, and A. Geiger, Joint 3d estimation of vehicles and scene flow, ISPRS Workshop on Image Sequence Analysis (ISA), 2015.

A. Newell, K. Yang, and J. Deng, Stacked hourglass networks for human pose estimation, 2016.

H. Noh, S. Hong, and B. Han, Learning deconvolution network for semantic segmentation, Proceedings of the IEEE International Conference on Computer Vision, pp.1520-1528, 2015.

C. Russell, P. Kohli, and P. H. Torr, Associative hierarchical crfs for object class image segmentation, IEEE 12th International Conference on Computer Vision, pp.739-746, 2009.

D. Scharstein, H. Hirschmüller, Y. Kitajima, G. Krathwohl, N. Ne?i´ne?i´c et al., High-resolution stereo datasets with subpixel-accurate ground truth, German Conference on Pattern Recognition, pp.31-42, 2014.

A. G. Schwing and R. Urtasun, Fully connected deep structured networks, 2015.

A. Seki and M. Pollefeys, Patch based confidence prediction for dense disparity map, British Machine Vision Conference (BMVC), p.11, 2016.

J. Shotton, M. Johnson, and R. Cipolla, Semantic texton forests for image categorization and segmentation, Computer vision and pattern recognition, pp.1-8, 2008.

J. Shotton, T. Sharp, A. Kipman, A. Fitzgibbon, M. Finocchio et al., Real-time human pose recognition in parts from single depth images, Communications of the ACM, vol.56, issue.1, pp.116-124, 2013.

J. Shotton, J. Winn, C. Rother, and A. Criminisi, Textonboost for image understanding: Multi-class object recognition and segmentation by jointly modeling texture, layout, and context, International Journal of Computer Vision, vol.81, issue.1, pp.2-23, 2009.

K. Simonyan and A. Zisserman, Very deep convolutional networks for large-scale image recognition, 2014.

R. K. Srivastava, K. Greff, and J. Schmidhuber, , 2015.

O. Teboul, I. Kokkinos, L. Simon, P. Koutsourakis, and N. Paragios, Shape grammar parsing via reinforcement learning, Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on, pp.2273-2280, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00856135

K. Yamaguchi, D. Mcallester, and R. Urtasun, Efficient joint segmentation, occlusion labeling, stereo and flow estimation, European Conference on Computer Vision, p.11, 2014.

F. Yu and V. Koltun, Multi-scale context aggregation by dilated convolutions, 2015.

S. Zagoruyko and N. Komodakis, Learning to compare image patches via convolutional neural networks, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp.4353-4361, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01246261

J. Zbontar and Y. Lecun, Computing the stereo matching cost with a convolutional neural network, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp.1592-1599, 2015.

J. Zbontar and Y. Lecun, Stereo matching by training a convolutional neural network to compare image patches, The Journal of Machine Learning Research, vol.17, issue.1, p.11, 2016.

S. Zheng, S. Jayasumana, B. Romera-paredes, V. Vineet, Z. Su et al., Conditional random fields as recurrent neural networks. In Proceedings of the IEEE International Conference on Computer Vision, pp.1529-1537, 2015.