Exploiting homogeneity for the optimal control of discrete-time systems: application to value iteration

Mathieu Granzotto; Romain Postoyan; Lucian Buşoniu; Dragan Nešić; Jamal Daafouz

Pré-Publication, Document De Travail Année : 2021

Exploiting homogeneity for the optimal control of discrete-time systems: application to value iteration

(1) , (1) , (2) , (3) , (1)

1
2
3

Mathieu Granzotto

Fonction : Auteur
PersonId : 1036621
IdRef : 243009380

Centre de Recherche en Automatique de Nancy

Romain Postoyan

Fonction : Auteur
PersonId : 845
IdHAL : romain-postoyan
ORCID : 0000-0002-2454-602X
IdRef : 13970535X

Centre de Recherche en Automatique de Nancy

Lucian Buşoniu

Fonction : Auteur

Universitatea Tehnica din Cluj-Napoca

Dragan Nešić

Fonction : Auteur

Department of Electrical and Electronic Engineering [Melbourne]

Jamal Daafouz

Fonction : Auteur
PersonId : 936663
ORCID : 0000-0001-8313-8790
IdRef : 122546008

Centre de Recherche en Automatique de Nancy

Résumé

To investigate solutions of (near-)optimal control problems, we extend and exploit a notion of homogeneity recently proposed in the literature for discrete-time systems. Assuming the plant dynamics is homogeneous, we first derive a scaling property of its solutions along rays provided the sequence of inputs is suitably modified. We then consider homogeneous cost functions and reveal how the optimal value function scales along rays. This result can be used to construct (near-)optimal inputs on the whole state space by only solving the original problem on a given compact manifold of a smaller dimension. Compared to the related works of the literature, we impose no conditions on the homogeneity degrees. We demonstrate the strength of this new result by presenting a new approximate scheme for value iteration, which is one of the pillars of dynamic programming. The new algorithm provides guaranteed lower and upper estimates of the true value function at any iteration and has several appealing features in terms of reduced computation. A numerical case study is provided to illustrate the proposed algorithm.

Domaines

Optimisation et contrôle [math.OC]

mathieu Granzotto : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03353317

Soumis le : vendredi 24 septembre 2021-07:07:36

Dernière modification le : lundi 15 avril 2024-18:10:38

Dates et versions

hal-03353317 , version 1 (24-09-2021)

Identifiants

HAL Id : hal-03353317 , version 1
ARXIV : 2109.11088

Citer

Mathieu Granzotto, Romain Postoyan, Lucian Buşoniu, Dragan Nešić, Jamal Daafouz. Exploiting homogeneity for the optimal control of discrete-time systems: application to value iteration. 2021. ⟨hal-03353317⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS CRAN UNIV-LORRAINE TDS-MACS

21 Consultations

0 Téléchargements

Exploiting homogeneity for the optimal control of discrete-time systems: application to value iteration

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager