S. Achille, A. Achille, and S. Soatto, A Separation Principle for Control in the Age of Deep Learning, Annual Review of Control, Robotics, and Autonomous Systems, vol.1, issue.1, 2017.
DOI : 10.1146/annurev-control-060117-105140

. Agrawal, Learning to poke by poking: Experiential learning of intuitive physics, 2016.

T. Alvernaz, S. Alvernaz, and J. Togelius, Autoencoder-augmented neuroevolution for visual doom playing, 2017 IEEE Conference on Computational Intelligence and Games (CIG), 2017.
DOI : 10.1109/CIG.2017.8080408

. Assael, , 2015.

, Data-efficient learning of feedback policies from image pixels using deep dynamical models, NIPS Deep Reinforcement Learning Workshop

. Beattie, Deepmind lab, 2016.

. Bellemare, The Arcade Learning Environment: An Evaluation Platform for General Agents, Journal of Artificial Intelligence Research, vol.47, 2013.
DOI : 10.1613/jair.3912

. Bengio, Unsupervised feature learning and deep learning: A review and new perspectives, 2012.

. Bohg, Interactive Perception: Leveraging Action in Perception and Perception in Action, IEEE Transactions on Robotics, vol.33, issue.6, pp.1273-1291, 2017.
DOI : 10.1109/TRO.2017.2721939

. Bousmalis, Using simulation and domain adaptation to improve efficiency of deep robotic grasping, 2017.

. Brockman, Openai gym, 2016.

. Böhmer, Autonomous Learning of State Representations for Control: An Emerging Field Aims to Autonomously Learn State Representations for Reinforcement Learning Agents from Their Real-World Sensor Observations, KI - K??nstliche Intelligenz, vol.14, issue.4, pp.1-10, 2015.
DOI : 10.1162/089976602317318938

C. , Infogan: Interpretable representation learning by information maximizing generative adversarial nets, Advances in Neural Information Processing Systems, pp.2172-2180, 2016.

. Chopra, Learning a Similarity Metric Discriminatively, with Application to Face Verification, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), pp.539-546, 2005.
DOI : 10.1109/CVPR.2005.202

. Conti, Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents, 2017.

. Curran, Dimensionality reduced reinforcement learning for assistive robots, 2016.

. Curran, Using PCA to Efficiently Represent State Spaces, 2015.

R. Deisenroth, M. P. Deisenroth, and C. E. Rasmussen, Pilco: A model-based and data-efficient approach to policy search, Proceedings of the 28th International Conference on International Conference on Machine Learning, pp.465-472, 2011.

. Donahue, Adversarial feature learning, 2016.

W. Duan, Learning state representations for robotic control, 2017.

. Dumoulin, Adversarially learned inference. arXiv preprint, 2016.

. Engel, Learning to control an octopus arm with gaussian process temporal difference methods, Advances in neural information processing systems, pp.347-354, 2006.

. Finn, Deep spatial autoencoders for visuomotor learning, 2016 IEEE International Conference on Robotics and Automation (ICRA), 2015.
DOI : 10.1109/ICRA.2016.7487173

I. K. Fodor, A survey of dimension reduction techniques, 2002.
DOI : 10.2172/15002155

. Goodfellow, Generative adversarial nets, pp.2672-2680, 2014.

F. Goodman, B. Goodman, and S. Flaxman, European union regulations on algorithmic decision-making and a" right to explanation". arXiv preprint, 2016.

. Goroshin, Learning to linearize under uncertainty, 2015.

S. Ha, D. Ha, and J. Schmidhuber, World Models, 2018.

. Henderson, Deep reinforcement learning that matters, 2017.

. Higgins, beta-vae: Learning basic visual concepts with a constrained variational framework, 2016.

. Higgins, DARLA: Improving Zero-Shot Transfer in Reinforcement Learning, 2017.

. Hinton, A Fast Learning Algorithm for Deep Belief Nets, Neural Computation, vol.18, issue.7, pp.1527-1554, 2006.
DOI : 10.1162/jmlr.2003.4.7-8.1235

P. Indyk, Algorithmic applications of low-distortion geometric embeddings, Proceedings 2001 IEEE International Conference on Cluster Computing, pp.10-33, 2001.
DOI : 10.1109/SFCS.2001.959878

. Jimenez-rezende, , 2014.

, Stochastic Backpropagation and Approximate Inference in Deep Generative Models. ArXiv e-prints

. Jonschkowski, . Brock, R. Jonschkowski, and O. Brock, Learning state representations with robotic priors, Autonomous Robots, vol.14, issue.4, pp.407-428, 2015.
DOI : 10.1162/089976602317318938

. Jonschkowski, PVEs: Position-Velocity Encoders for Unsupervised Learning of Structured State Representations, 2017.

. Karakovskiy and T. J. Sergey, The Mario AI Benchmark and Competitions, IEEE Transactions on Computational Intelligence and AI in Games, vol.4, issue.1, pp.55-67, 2012.
DOI : 10.1109/TCIAIG.2012.2188528

K. , Deep Variational Bayes Filters: Unsupervised Learning of State Space Models from Raw Data, 2016.

. Kingma, . Welling, D. P. Kingma, and M. Welling, Auto-Encoding Variational Bayes, 2013.

. Klyubin, Empowerment: A Universal Agent-Centric Measure of Control, 2005 IEEE Congress on Evolutionary Computation, pp.128-135, 2005.
DOI : 10.1109/CEC.2005.1554676

. Kompella, Incremental slow feature analysis: Adaptive and episodic learning from high-dimensional input streams, 2011.

. Krishnan, Deep Kalman Filters, 2015.

. Lake, Abstract, Behavioral and Brain Sciences, vol.34, 2016.
DOI : 10.1016/j.neunet.2014.09.003

. Lesort, Unsupervised state representation learning with robotic priors: a robustness benchmark, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01644423

. Lillicrap, Continuous control with deep reinforcement learning, 2015.

Z. C. Lipton, The Mythos of Model Interpretability, 2016.

I. Magrans-de-abril and R. Kanai, Curiosity-driven reinforcement learning with homeostatic regulation, 2018.

. Mattner, Learn to Swing Up and Balance a Real Pole Based on Raw Visual Input Data, Neural Information Processing -19th International Conference Proceedings, Part V, pp.126-133, 2012.
DOI : 10.1007/978-3-642-34500-5_16

. Mnih, Strategic attentive writer for learning macro-actions, 2016.

. Mnih, Human-level control through deep reinforcement learning, Nature, vol.101, issue.7540, pp.518529-533, 2015.
DOI : 10.1016/S0004-3702(98)00023-X

. Mouret, Crossing the reality gap: a short introduction to the transferability approach, 1307.
URL : https://hal.archives-ouvertes.fr/hal-01300706

. Munk, Learning state representation for deep actor-critic control, 2016 IEEE 55th Conference on Decision and Control (CDC), pp.4667-4673, 2016.
DOI : 10.1109/CDC.2016.7798980

. Oh, Value Prediction Network, 2017.

. Oudeyer, Intrinsic Motivation Systems for Autonomous Mental Development, IEEE Transactions on Evolutionary Computation, vol.11, issue.2, pp.265-286, 2007.
DOI : 10.1109/TEVC.2006.890271

. Parisi, Goal-driven dimensionality reduction for reinforcement learning, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2017.
DOI : 10.1109/IROS.2017.8206334

. Pathak, Curiositydriven exploration by self-supervised prediction, ICML, 2017.

. Pinto, The Curious Robot: Learning Visual Representations via Physical Interactions, 2016.
DOI : 10.1109/ICRA.2011.5980382

. Péré, Unsupervised learning of goal spaces for intrinsically motivated goal exploration, 2018.

. Rusu, Progressive neural networks. CoRR, p.160604671, 2016.

. Sermanet, Time-Contrastive Networks: Self-Supervised Learning from Multi-view Observation, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2017.
DOI : 10.1109/CVPRW.2017.69

. Shelhamer, Loss is its own reward: Self-supervision for reinforcement learning. arXiv preprint, 2017.

S. Stulp, F. Stulp, and O. Sigaud, Abstract, Paladyn, Journal of Behavioral Robotics, vol.4, issue.1, pp.49-61, 2013.
DOI : 10.2478/pjbr-2013-0003

R. S. Sutton, Reinforcement Learning: An Introduction, IEEE Transactions on Neural Networks, vol.9, issue.5, 1998.
DOI : 10.1109/TNN.1998.712192

. Tassa, DeepMind Control Suite, 2018.

. Thomas, , 2017.

H. Van, Stable reinforcement learning with autoencoders for tactile and visual data, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp.3928-3934, 2016.

. Vincent, Extracting and composing robust features with denoising autoencoders, Proceedings of the 25th international conference on Machine learning, ICML '08, pp.1096-1103, 2008.
DOI : 10.1145/1390156.1390294

. Vincent, Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion, Journal of Machine Learning Research, vol.11, pp.3371-3408, 2010.

. Wahlström, From Pixels to Torques: Policy Learning with Deep Dynamical Models, 2015.

. Wang, A Folded Neural Network Autoencoder for Dimensionality Reduction, INNS-WC, volume 13 of Procedia Computer Science, pp.120-127, 2012.
DOI : 10.1016/j.procs.2012.09.120

URL : https://doi.org/10.1016/j.procs.2012.09.120

. Wang, Auto-encoder based dimensionality reduction, Neurocomputing, vol.184, issue.C, pp.232-242, 2016.
DOI : 10.1016/j.neucom.2015.08.104

. Watter, Embed to control: A locally linear latent dynamics model for control from raw images, Advances in Neural Information Processing Systems 28, pp.2746-2754, 2015.

. Wiskott, . Sejnowski, L. Wiskott, and T. J. Sejnowski, Slow Feature Analysis: Unsupervised Learning of Invariances, Neural Computation, vol.13, issue.11, pp.715-770, 2002.
DOI : 10.1016/S0301-0082(96)00054-8

URL : http://papers.cnl.salk.edu/PDFs/Slow%20Feature%20Analysis_%20Unsupervised%20Learning%20of%20Invariances%202002-3430.pdf

Y. , Deep multimodal representation learning from temporal data, 2017.

. Zhang, Decoupling dynamics and reward for transfer learning, Proceedings of the 6th International Conference on Learning Representations (ICLR) workshops, 2018.

. Zhang, A new embedding quality assessment method for manifold learning, Neurocomputing, vol.97, pp.251-266, 2012.
DOI : 10.1016/j.neucom.2012.05.013