Online policy iterations for optimal control of input-saturated systems - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2016

Online policy iterations for optimal control of input-saturated systems

Résumé

This work proposes an online policy iteration procedure for the synthesis of sub-optimal control laws for uncertain Linear Time Invariant (LTI) Asymptotically Null-Controllable with Bounded Inputs (ANCBI) systems. The proposed policy iteration method relies on: a policy evaluation step with a piecewise quadratic Lyapunov function in both the state and the deadzone functions of the input signals; a policy improvement step which guarantees at the same time close to optimality (exploitation) and persistence of excitation (exploration). The proposed approach guarantees convergence of the trajectory to a neighborhood around the origin. Besides, the trajectories can be made arbitrarily close to the optimal one provided that the rate at which the the value function and the control policy are updated is fast enough. The solution to the inequalities required to hold at each policy evaluation step can be efficiently implemented with semidefinite programming (SDP) solvers. A numerical example illustrates the results.
Fichier principal
Vignette du fichier
valmorbidaSat_resub_ACC3.pdf (138.42 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01710299 , version 1 (13-04-2020)

Identifiants

Citer

Simone Baldi, Giorgio Valmorbida, Antonis Papachristodoulou, Elias Kosmatopoulos. Online policy iterations for optimal control of input-saturated systems. 2016 American Control Conference (ACC), Jul 2016, Boston, United States. ⟨10.1109/ACC.2016.7526568⟩. ⟨hal-01710299⟩
49 Consultations
53 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More