J. Long, E. Shelhamer, and T. Darrell, Fully convolutional networks for semantic segmentation, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.3431-3440, 2015.
DOI : 10.1109/CVPR.2015.7298965

M. Everingham, S. M. Eslami, L. V. Gool, C. K. Williams, J. Winn et al., The Pascal Visual Object Classes Challenge: A Retrospective, International Journal of Computer Vision, vol.34, issue.11, pp.98-136, 2014.
DOI : 10.1007/s11263-014-0733-5

T. Y. Lin, M. Maire, S. Belongie, J. Hays, P. Perona et al., Microsoft COCO: Common Objects in Context, Computer Vision ? ECCV 2014. Number 8693 in Lecture Notes in Computer Science, pp.740-755, 2014.
DOI : 10.1007/978-3-319-10602-1_48

A. Lagrange, L. Saux, B. Beaupere, A. Boulch, A. Chan-hon-tong et al., Benchmarking classification of earth-observation data: From learning explicit features to convolutional networks, 2015 IEEE International Geoscience and Remote Sensing Symposium (IGARSS), pp.4173-4176, 2015.
DOI : 10.1109/IGARSS.2015.7326745

S. Paisitkriangkrai, J. Sherrah, P. Janney, and A. Van-den-hengel, Effective semantic pixel labelling with convolutional networks and Conditional Random Fields, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp.36-43, 2015.
DOI : 10.1109/CVPRW.2015.7301381

F. Rottensteiner, G. Sohn, J. Jung, M. Gerke, C. Baillard et al., THE ISPRS BENCHMARK ON URBAN OBJECT CLASSIFICATION AND 3D BUILDING RECONSTRUCTION, ISPRS Annals of Photogrammetry, Remote Sensing and Spatial Information Sciences, vol.3, issue.3, 2012.
DOI : 10.5194/isprsannals-I-3-293-2012

L. C. Chen, G. Papandreou, I. Kokkinos, K. Murphy, and A. Yuille, Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs, Proceedings of the International Conference on Learning Representations, 2015.

F. Yu and V. Koltun, Multi-Scale Context Aggregation by Dilated Convolutions, Proceedings of the International Conference on Learning Representations, 2015.

A. Arnab, S. Jayasumana, S. Zheng, and P. Torr, Higher Order Conditional Random Fields in Deep Neural Networks
DOI : 10.1007/978-3-319-46475-6_33

K. He, X. Zhang, S. Ren, and J. Sun, Deep Residual Learning for Image Recognition, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016.
DOI : 10.1109/CVPR.2016.90

Z. Wu, C. Shen, and A. Van-den-hengel, High-performance Semantic Segmentation Using Very Deep Fully Convolutional Networks

Z. Yan, H. Zhang, Y. Jia, T. Breuel, and Y. Yu, Combining the Best of Convolutional Layers and Recurrent Layers: A Hybrid Network for Semantic Segmentation, 2016.

J. Zhao, M. Mathieu, R. Goroshin, and Y. Lecun, Stacked What-Where Autoencoders, Proceedings of the International Conference on Learning Representations, 2015.

H. Noh, S. Hong, and B. Han, Learning Deconvolution Network for Semantic Segmentation, 2015 IEEE International Conference on Computer Vision (ICCV), pp.1520-1528, 2015.
DOI : 10.1109/ICCV.2015.178

V. Badrinarayanan, A. Kendall, and R. Cipolla, SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation. arXiv preprint arXiv:1511, p.561, 2015.

V. Mnih and G. E. Hinton, Learning to Detect Roads in High-Resolution Aerial Images, Computer Vision ? ECCV 2010. Number 6316 in Lecture Notes in Computer Science, pp.210-223, 2010.
DOI : 10.1007/978-3-642-15567-3_16

O. Penatti, K. Nogueira, D. Santos, and J. , Do deep features generalize from everyday objects to remote sensing and aerial scenes domains?, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp.44-51, 2015.
DOI : 10.1109/CVPRW.2015.7301382

K. Nogueira, O. A. Penatti, D. Santos, and J. A. , Towards better exploiting convolutional neural networks for remote sensing scene classification, Pattern Recognition, vol.61
DOI : 10.1016/j.patcog.2016.07.001

W. Zhao and S. Du, Learning multiscale and deep representations for classifying remotely sensed imagery, ISPRS Journal of Photogrammetry and Remote Sensing, vol.113, pp.155-165, 2016.
DOI : 10.1016/j.isprsjprs.2016.01.004

D. Marmanis, J. D. Wegner, S. Galliani, K. Schindler, M. Datcu et al., Semantic Segmentation of Aerial Images with an Ensemble of CNNs, ISPRS Annals of Photogrammetry, Remote Sensing and Spatial Information Sciences, vol.3, pp.473-480, 2016.

M. Gerke, Use of the Stair Vision Library within the ISPRS 2d Semantic Labeling Benchmark (Vaihingen) Technical report, International Institute for Geo- Information Science and Earth Observation, 2015.

K. Chatfield, K. Simonyan, A. Vedaldi, and A. Zisserman, Return of the Devil in the Details: Delving Deep into Convolutional Nets, Proceedings of the British Machine Vision Conference 2014, pp.6-7, 2014.
DOI : 10.5244/C.28.6

K. Simonyan and A. Zisserman, Very Deep Convolutional Networks for Large-Scale Image Recognition, 2014.

S. Ioffe and C. Szegedy, Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift, Proceedings of the 32nd International Conference on Machine Learning, pp.448-456, 2015.

K. He, X. Zhang, S. Ren, and J. Sun, Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification, 2015 IEEE International Conference on Computer Vision (ICCV), pp.1026-1034, 2015.
DOI : 10.1109/ICCV.2015.123

D. A. Clevert, T. Unterthiner, and S. Hochreiter, Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs), Proceedings of the International Conference on Learning Representations, 2015.

D. Marmanis, M. Datcu, T. Esch, and U. Stilla, Deep Learning Earth Observation Classification Using ImageNet Pretrained Networks, IEEE Geoscience and Remote Sensing Letters, vol.13, issue.1, pp.105-109, 2016.
DOI : 10.1109/LGRS.2015.2499239

C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed et al., Going deeper with convolutions, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.1-9, 2015.
DOI : 10.1109/CVPR.2015.7298594

R. Liao, X. Tao, R. Li, Z. Ma, and J. Jia, Video Super-Resolution via Deep Draft-Ensemble Learning, 2015 IEEE International Conference on Computer Vision (ICCV), pp.531-539, 2015.
DOI : 10.1109/ICCV.2015.68

Z. Liao and G. Carneiro, Competitive Multi-scale Convolution

A. Eitel, J. T. Springenberg, L. Spinello, M. Riedmiller, and W. Burgard, Multimodal deep learning for robust RGB-D object recognition, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp.681-687, 2015.
DOI : 10.1109/IROS.2015.7353446

N. T. Quang, N. T. Thuy, D. V. Sang, and H. T. Binh, An Efficient Framework for Pixel-wise Building Segmentation from Aerial Images, Proceedings of the Sixth International Symposium on Information and Communication Technology, SoICT 2015, p.43, 2015.
DOI : 10.1145/2833258.2833272

A. Boulch, DAG of convolutional networks for semantic labeling, Office national d'´ etudes et de recherches aérospatiales, 2015.

G. Lin, C. Shen, A. Van-den-hengel, and I. Reid, Efficient Piecewise Training of Deep Structured Models for Semantic Segmentation, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015.
DOI : 10.1109/CVPR.2016.348

M. Cramer, The DGPF test on digital aerial camera evaluation ? overview and test design, Photogrammetrie ? Fernerkundung ? Geoinformation, vol.2, pp.73-82, 2010.