Dynamic Programming and Optimal Control, Athena Scientific, vol.2, 2012. ,
Stochastic Optimal Control: The Discrete Time Case, 1978. ,
Approximate dynamic programming with a fuzzy parameterization, Automatica, vol.46, issue.5, pp.804-814, 2010. ,
Controlling groups of mobile beamformers, Proceedings 51st IEEE Conference on Decision and Control (CDC), pp.1984-1989, 2012. ,
Robust control for mobility and wireless communication in cyber-physical systems with application to robot teams, Proceedings of the IEEE, vol.100, issue.1, pp.164-178, 2012. ,
Trajectory optimization for mobile access point, 51st Asilomar Conference on Signals, Systems, and Computers, pp.1412-1416, 2017. ,
Trajectory planning for energy-efficient vehicles with communications constraints, Proceedings 2016 International Conference on Wireless Networks and Mobile Communications (WINCOM16), pp.264-270, 2016. ,
Robust trajectory planning for robotic communications under fading channels, Ubiquitous Networking: Third International Symposium, vol.10542, p.450, 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01745270
Self-improving reactive agents based on reinforcement learning, planning and teaching, Machine Learning, vol.8, issue.3-4, pp.293-321, 1992. ,
Human-level control through deep reinforcement learning, Nature, vol.518, pp.529-533, 2015. ,
Prioritized sweeping: Reinforcement learning with less data and less time, Machine Learning, vol.13, pp.103-130, 1993. ,
Consensus and cooperation in networked multi-agent systems, Proceedings of the IEEE, vol.95, issue.1, pp.215-233, 2007. ,
Minimal energy path planning for wireless robots, Mobile Networks and Applications, vol.14, issue.3, pp.309-321, 2009. ,
Enhanced cell-edge performance with transmit power-shaping and multipoint, multiflow techniques, ZTE Communications, issue.4, 2011. ,
Multi-robot exploration under the constraints of wireless networking, Control Engineering Practice, vol.15, issue.4, pp.435-445, 2007. ,
Integrated architectures for learning, planning, and reacting based on approximating dynamic programming, Proceedings 7th International Conference on Machine Learning (ICML-90), pp.216-224, 1990. ,
, ser. Adaptive Computation and Machine Learning. A, 2018.
Q-learning, Machine Learning, vol.8, pp.279-292, 1992. ,
Co-optimization of communication and motion planning of a robotic operation under resource constraints and in fading environments, IEEE Transactions on Wireless Communications, vol.12, issue.4, pp.1562-1572, 2013. ,