Improving MACS thanks to a comparison with 2TBNs - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2004

Improving MACS thanks to a comparison with 2TBNs

Thierry Gourdin
  • Fonction : Auteur
  • PersonId : 1004295
Olivier Sigaud
Pierre-Henri Wuillemin

Résumé

Factored Markov Decision Processes is the theoretical framework underlying multi-step Learning Classifier Systems research. This framework is mostly used in the context of Two-stage Bayes Networks, a subset of Bayes Networks. In this paper, we compare the Learning Classifier Systems approach and the Bayes Networks approach to factored Markov Decision Problems. More specifically, we focus on a comparison between MACS, an Anticipatory Learning Classifier System, and Structured Policy Iteration, a general planning algorithm used in the context of Two-stage Bayes Networks. From that comparison, we define a new algorithm resulting from the adaptation of Structured Policy Iteration to the context of MACS. We conclude by calling for a closer communication between both research communities.

Dates et versions

hal-01501406 , version 1 (04-04-2017)

Identifiants

Citer

Thierry Gourdin, Olivier Sigaud, Pierre-Henri Wuillemin. Improving MACS thanks to a comparison with 2TBNs. GECCO 2004 - Genetic and Evolutionary Computation Conference, Jun 2004, Seattle, WA, United States. pp.810-823, ⟨10.1007/978-3-540-24855-2_95⟩. ⟨hal-01501406⟩
63 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More