Exploiting homogeneity for the optimal control of discrete-time systems: application to value iteration - Centre de Recherche en Automatique de Nancy Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2021

Exploiting homogeneity for the optimal control of discrete-time systems: application to value iteration

Résumé

To investigate solutions of (near-)optimal control problems, we extend and exploit a notion of homogeneity recently proposed in the literature for discrete-time systems. Assuming the plant dynamics is homogeneous, we first derive a scaling property of its solutions along rays provided the sequence of inputs is suitably modified. We then consider homogeneous cost functions and reveal how the optimal value function scales along rays. This result can be used to construct (near-)optimal inputs on the whole state space by only solving the original problem on a given compact manifold of a smaller dimension. Compared to the related works of the literature, we impose no conditions on the homogeneity degrees. We demonstrate the strength of this new result by presenting a new approximate scheme for value iteration, which is one of the pillars of dynamic programming. The new algorithm provides guaranteed lower and upper estimates of the true value function at any iteration and has several appealing features in terms of reduced computation. A numerical case study is provided to illustrate the proposed algorithm.

Dates et versions

hal-03353317 , version 1 (24-09-2021)

Identifiants

Citer

Mathieu Granzotto, Romain Postoyan, Lucian Buşoniu, Dragan Nešić, Jamal Daafouz. Exploiting homogeneity for the optimal control of discrete-time systems: application to value iteration. 2021. ⟨hal-03353317⟩
21 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More