Feature Discovery in Approximate Dynamic Programming

Philippe Preux; Sertan Girgin; Manuel Loth

Communication Dans Un Congrès Année : 2009

Feature Discovery in Approximate Dynamic Programming

(1, 2) , (1, 2) , (1, 2)

1
2

Philippe Preux

Fonction : Auteur
PersonId : 5488
IdHAL : preux-philippe
IdRef : 059896353

Laboratoire d'Informatique Fondamentale de Lille

Sequential Learning

Sertan Girgin

Fonction : Auteur

Laboratoire d'Informatique Fondamentale de Lille

Sequential Learning

Manuel Loth

Fonction : Auteur

Laboratoire d'Informatique Fondamentale de Lille

Sequential Learning

Résumé

Feature discovery aims at finding the best representation of data. This is a very important topic in machine learning, and in reinforcement learning in particular. Based on our recent work on feature discovery in the context of reinforcement learning to discover a good, if not the best, representation of states, we report here on the use of the same kind of approach in the context of approximate dynamic programming. The striking difference with the usual approach is that we use a non parametric function approximator to represent the value function, instead of a parametric one. We also argue that the problem of discovering the best state representation and the problem of the value function approximation are just the two faces of the same coin, and that using a non parametric approach provides an elegant solution to both problems at once.

Domaines

Machine Learning [stat.ML] Apprentissage [cs.LG] Intelligence artificielle [cs.AI] Réseau de neurones [cs.NE] Optimisation et contrôle [math.OC] Autres [stat.ML] Autre [cs.OH]

Preux Philippe : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00351144

Soumis le : jeudi 8 janvier 2009-15:03:43

Dernière modification le : lundi 22 avril 2024-14:20:35

Dates et versions

hal-00351144 , version 1 (08-01-2009)

Identifiants

HAL Id : hal-00351144 , version 1

Citer

Philippe Preux, Sertan Girgin, Manuel Loth. Feature Discovery in Approximate Dynamic Programming. Approximate Dynamic Programming and Reinforcement Learning, Mar 2009, Nashville, United States. ⟨hal-00351144⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-LILLE3 CNRS INRIA LIFL LAGIS INRIA2 TDS-MACS

246 Consultations

0 Téléchargements

Feature Discovery in Approximate Dynamic Programming

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager