G. Williams, N. Wagener, B. Goldfain, P. Drews, J. M. Rehg et al., Information theoretic mpc for model-based reinforcement learning

U. Muller, J. Ben, E. Cosatto, B. Flepp, and Y. L. Cun, Offroad obstacle avoidance through end-to-end learning, Advances in neural information processing systems, pp.739-746, 2006.

A. Mahé, C. Pradalier, and M. Geist, Trajectory-control using deep system identication and model predictive control for drone control under uncertain load, 2018 22nd International Conference on System Theory, Control and Computing (ICSTCC), pp.753-758, 2018.

T. Schaul, J. Quan, I. Antonoglou, and D. Silver, Prioritized experience replay, 2015.

A. Katharopoulos and F. Fleuret, Biased importance sampling for deep neural network training, CoRR, 2017.

W. Rawat and Z. Wang, Deep convolutional neural networks for image classification: A comprehensive review, Neural computation, vol.29, issue.9, pp.2352-2449, 2017.

K. S. Narendra and K. Parthasarathy, Neural networks and dynamical systems, International Journal of Approximate Reasoning, vol.6, issue.2, pp.109-131, 1992.

K. S. Narendra and S. Mukhopadhyay, Intelligent control using neural networks, IEEE Control systems magazine, vol.12, issue.2, pp.11-18, 1992.

O. Ogunmolu, X. Gu, S. Jiang, and N. Gans, Nonlinear systems identification using deep dynamic neural networks, 2016.

B. De-moor, P. De-gersem, B. D. Schutter, and W. Favoreel, Daisy: A database for identification of systems, JOURNAL A, vol.38, pp.4-5, 1997.

Y. Lecun and Y. Bengio, Convolutional networks for images, speech, and time series, The handbook of brain theory and neural networks, vol.3361, p.1995, 1995.

Z. C. Lipton, J. Berkowitz, and C. Elkan, A critical review of recurrent neural networks for sequence learning, 2015.

K. Cho, B. Van-merriënboer, C. Gulcehre, D. Bahdanau, F. Bougares et al., Learning phrase representations using rnn encoder-decoder for statistical machine translation, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01433235

S. Hochreiter and J. Schmidhuber, Long short-term memory, Neural computation, vol.9, issue.8, pp.1735-1780, 1997.

Z. Che, S. Purushotham, K. Cho, D. Sontag, and Y. Liu, Recurrent neural networks for multivariate time series with missing values, Scientific reports, vol.8, issue.1, p.6085, 2018.

V. Nair and G. E. Hinton, Rectified linear units improve restricted boltzmann machines, Proceedings of the 27th international conference on machine learning (ICML-10), pp.807-814, 2010.

A. L. Maas, A. Y. Hannun, and A. Y. Ng, Rectifier nonlinearities improve neural network acoustic models, Proc. icml, vol.30, p.3, 2013.

K. Simonyan and A. Zisserman, Very deep convolutional networks for large-scale image recognition, CoRR, 2014.

D. P. Kingma and J. Ba, Adam: A method for stochastic optimization, 2014.

N. Koenig and A. Howard, Design and use paradigms for gazebo, an open-source multi-robot simulator, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)(IEEE Cat. No. 04CH37566), vol.3, pp.2149-2154

M. Quigley, K. Conley, B. P. Gerkey, J. Faust, T. Foote et al., Ros: an open-source robot operating system, ICRA Workshop on Open Source Software, 2009.

M. M. Manhães, S. A. Scherer, M. Voss, L. R. Douat, and T. Rauschenbach, UUV simulator: A gazebo-based package for underwater intervention and multi-robot simulation, OCEANS 2016 MTS/IEEE Monterey, 2016.

M. Burri, J. Nikolic, P. Gohl, T. Schneider, J. Rehder et al., The euroc micro aerial vehicle datasets, The International Journal of Robotics Research, 2016.

M. A. , TensorFlow: Large-scale machine learning on heterogeneous systems, 2015, software available from tensorflow.org