J. Sivic, B. C. Russell, A. A. Efros, A. Zisserman, and W. T. Freeman, Discovering objects and their location in images, vol.1, pp.370-377, 2005.

F. Perronnin and C. Dance, Fisher kernels on visual vocabularies for image categorization, CVPR, pp.1-8, 2007.

R. Arandjelovi? and A. Zisserman, All about VLAD, IEEE CVPR, 2013.

A. Krizhevsky, I. Sutskever, and G. E. Hinton, ImageNet classification with deep convolutional neural networks, NIPS'12 -Volume, vol.1, pp.1097-1105, 2012.

K. Simonyan, A. Vedaldi, and A. Zisserman, Deep Fisher networks for large-scale image classification, NIPS'13, pp.163-171, 2013.

R. Arandjelovic, P. Gronát, A. Torii, T. Pajdla, and J. Sivic, NetV-LAD : CNN architecture for weakly supervised place recognition, CoRR, 2015.

A. Barachant, S. Bonnet, M. Congedo, and C. Jutten, Classification of covariance matrices using a Riemannian-based kernel for BCI applications, NeuroComputing, vol.112, pp.172-178, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00820475

S. Akodad, L. Bombrun, J. Xia, Y. Berthoumieu, and C. Germain, Hybrid deep neural network based on the log-euclidean Fisher vectors encoding of region covariance matrices, IEEE Trans. Geosci. Remote Sens, 2019.

S. Akodad, L. Bombrun, C. Yaacoub, Y. Berthoumieu, and C. Germain, Image classification based on log-Euclidean Fisher vectors for covariance matrix descriptors, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01930156

Z. Huang and L. V. Gool, A Riemannian network for SPD matrix learning, AAAI Conference on Artificial Intelligence, pp.2036-2042, 2017.

K. Yu and M. Salzmann, Second-order convolutional neural networks, CoRR, 2017.

D. Acharya, Z. Huang, D. P. Paudel, and L. V. Gool, Covariance pooling for facial expression recognition, CoRR, 2018.

E. Li, J. Xia, P. Du, C. Lin, and A. Samat, Integrating multilayer features of convolutional neural networks for remote sensing scene classification, IEEE Trans. Geosci. Remote Sens, vol.55, issue.10, pp.5653-5665, 2017.

V. Arsigny, P. Fillard, X. Pennec, and N. Ayache, Log-Euclidean metrics for fast and simple calculus on diffusion tensors, Magnetic Resonance in Medicine, vol.56, pp.411-421, 2006.
URL : https://hal.archives-ouvertes.fr/inria-00502678

K. Simonyan and A. Zisserman, Very deep convolutional networks for large-scale image recognition, CoRR, 2014.

N. He, L. Fang, S. Li, A. Plaza, and J. Plaza, Remote sensing scene classification using multilayer stacked covariance pooling, IEEE Trans. Geosci. Remote Sens, vol.56, issue.12, pp.6899-6910, 2018.