J. Sivic, B. C. Russell, A. A. Efros, A. Zisserman, and W. T. Freeman, Discovering objects and their location in images, Tenth IEEE International Conference on Computer Vision (ICCV'05, vol.1, pp.370-377, 2005.

R. Arandjelovi? and A. Zisserman, All about VLAD, IEEE CVPR, 2013.

F. Perronnin and C. Dance, Fisher kernels on visual vocabularies for image categorization, IEEE Conference on Computer Vision and Pattern Recognition, pp.1-8, 2007.

M. Douze, A. Ramisa, and C. Schmid, Combining attributes and Fisher vectors for efficient image retrieval, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp.745-752, 2011.
URL : https://hal.archives-ouvertes.fr/inria-00566293

J. Sánchez, F. Perronnin, T. Mensink, and J. Verbeek, Image classification with the Fisher vector: Theory and practice, International Journal of Computer Vision, vol.105, issue.3, pp.222-245, 2013.

A. Krizhevsky, I. Sutskever, and G. E. Hinton, ImageNet classification with deep convolutional neural networks, Proceedings of the 25th International Conference on Neural Information Processing Systems, vol.1, pp.1097-1105, 2012.

K. Simonyan, A. Vedaldi, and A. Zisserman, Deep Fisher networks for large-scale image classification, Proceedings of the 26th International Conference on Neural Information Processing Systems, vol.1, pp.163-171, 2013.

R. Arandjelovic, P. Gronát, A. Torii, T. Pajdla, and J. Sivic, NetVLAD: CNN architecture for weakly supervised place recognition, CoRR, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01242052

E. Li, J. Xia, P. Du, C. Lin, and A. Samat, Integrating multilayer features of convolutional neural networks for remote sensing scene classification, IEEE Transactions on Geoscience and Remote Sensing, vol.55, issue.10, pp.5653-5665, 2017.

B. Julesz, E. N. Gilbert, and J. D. Victor, Visual discrimination of textures with identical third-order statistics, Biological Cybernetics, vol.31, issue.3, pp.137-140, 1978.

M. Faraki, M. T. Harandi, A. Wiliem, and B. C. Lovell, Fisher tensors for classifying human epithelial cells, Pattern Recognition, vol.47, issue.7, pp.2348-2359, 2014.

M. Faraki, M. T. Harandi, and F. Porikli, More about VLAD: A leap from Euclidean to Riemannian manifolds, IEEE Conference on Computer Vision and Pattern Recognition, pp.4951-4960, 2015.

S. Akodad, L. Bombrun, C. Yaacoub, Y. Berthoumieu, and C. Germain, Image classification based on log-Euclidean Fisher vectors for covariance matrix descriptors, International Conference on Image Processing Theory, Tools and Applications (IPTA), 2018.
URL : https://hal.archives-ouvertes.fr/hal-01930156

I. Ilea, L. Bombrun, S. Said, and Y. Berthoumieu, Covariance matrices encoding based on the log-Euclidean and affine invariant Riemannian metrics, IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, ser. CVPRW'18, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01930136

C. Ionescu, O. Vantzos, and C. Sminchisescu, Matrix backpropagation for deep networks with structured layers, IEEE International Conference on Computer Vision (ICCV), pp.2965-2973, 2015.

Z. Huang and L. V. Gool, A Riemannian network for SPD matrix learning, AAAI Conference on Artificial Intelligence, pp.2036-2042, 2017.

K. Yu and M. Salzmann, Second-order convolutional neural networks, CoRR, 2017.

D. Acharya, Z. Huang, D. P. Paudel, and L. V. Gool, Covariance pooling for facial expression recognition, CoRR, 2018.

L. I. Kuncheva and C. J. Whitaker, Measures of diversity in classifier ensembles and their relationship with the ensemble accuracy, Machine Learning, vol.51, pp.181-207, 2003.

S. Akodad, L. Bombrun, J. Xia, Y. Berthoumieu, and C. Germain, Hybrid deep neural network based on the log-Euclidean Fisher vectors encoding of region covariance matrices, IEEE Transactions on Geoscience and Remote Sensing, 2019.

N. He, L. Fang, S. Li, A. Plaza, and J. Plaza, Remote sensing scene classification using multilayer stacked covariance pooling, IEEE Transactions on Geoscience and Remote Sensing, vol.56, issue.12, pp.6899-6910, 2018.

V. Arsigny, P. Fillard, X. Pennec, and N. Ayache, Log-Euclidean metrics for fast and simple calculus on diffusion tensors, Magnetic Resonance in Medicine, vol.56, issue.2, pp.411-421, 2006.
URL : https://hal.archives-ouvertes.fr/inria-00502678

K. Simonyan and A. Zisserman, Very deep convolutional networks for large-scale image recognition, CoRR, 2014.