Human-level control through deep reinforcement learning, Nature, vol.101, issue.7540, pp.529-533, 2015. ,
DOI : 10.1016/S0004-3702(98)00023-X
End-to-end Training of Deep Visuomotor Policies, J. Mach. Learn. Res, vol.17, issue.1, pp.1334-1373, 2016. ,
Optimal control with learned local models: Application to dexterous manipulation, 2016 IEEE International Conference on Robotics and Automation (ICRA), pp.378-383 ,
DOI : 10.1109/ICRA.2016.7487156
Deep spatial autoencoders for visuomotor learning, 2016 IEEE International Conference on Robotics and Automation (ICRA), pp.512-519, 2016. ,
DOI : 10.1109/ICRA.2016.7487173
URL : http://arxiv.org/pdf/1509.06113
Continuous control with deep reinforcement learning, 1509. ,
Learning Continuous Control Policies by Stochastic Value Gradients, NIPS, pp.2944-2952, 2015. ,
Continuous Deep Q-Learning with Model-based Acceleration, ICML, ser. JMLR Workshop and Conference Proceedings, pp.2829-2838, 2016. ,
Deep visual foresight for planning robot motion, 2017 IEEE International Conference on Robotics and Automation (ICRA), 2016. ,
DOI : 10.1109/ICRA.2017.7989324
URL : http://arxiv.org/pdf/1610.00696
Outlier Detection Using Replicator Neural Networks, DaWaK, ser, pp.170-180, 2002. ,
DOI : 10.1007/3-540-46145-0_17
URL : http://www.act.cmis.csiro.au/rohanb/PAPERS/dawak02.pdf
Support Vector Method for Novelty Detection, NIPS, pp.582-588, 1999. ,
Noisy Reinforcements in reinforcement learning: some case studies based on gridworlds, WSEAS, pp.296-300, 2006. ,
Taming the Noise in Reinforcement Learning via Soft Updates, UAI, 2016. ,
Adaptive Control Processes: A Guided Tour, 1961. ,
DOI : 10.1515/9781400874668
Deep auto-encoder neural networks in reinforcement learning, The 2010 International Joint Conference on Neural Networks (IJCNN), pp.1-8, 2010. ,
DOI : 10.1109/IJCNN.2010.5596468
URL : http://ml.informatik.uni-freiburg.de/_media/publications/langeijcnn2010.pdf
Deterministic Policy Gradient Algorithms, ICML, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-00938992
Deep Reinforcement Learning in Parameterized Action Space, CoRR, vol.abs, 1511. ,
Adam: A Method for Stochastic Optimization, ICLR, 2015. ,
Caffe, Proceedings of the ACM International Conference on Multimedia, MM '14, 2014. ,
DOI : 10.1145/2647868.2654889
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift, ICML, pp.448-456, 2015. ,