Building Accurate Strategies in Non Markovian Environments without Memory - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue Lecture Notes in Computer Science Année : 2010

Building Accurate Strategies in Non Markovian Environments without Memory

Résumé

This paper focuses on the study of the behavior of a genetic algorithm based classier system, the Adapted Pittsburgh Classier System (A.P.C.S), on maze type environments containing aliasing squares. This type of environment is often used in reinforcement learning literature to assess the performances of learning methods when facing problems containing non markovian situations. Through this study, we discuss on the performance of the APCS upon two mazes (Woods 101 and Maze E2) and also on the eciency of an improvement of the APCS learning method inspired from the XCS: the covering mechanism. We manage to show that, without any memory mechanism, the APCS is able to build and to keep accurate strategies to produce regular sub-optimal solution to these maze problems. This statement is shown through a comparison between the results obtained by the XCS on two specic maze problems and those obtained by the APCS.

Dates et versions

hal-00542922 , version 1 (03-12-2010)

Identifiants

Citer

Enée Gilles, Mathias Peroumalnaïk. Building Accurate Strategies in Non Markovian Environments without Memory. Lecture Notes in Computer Science, 2010, IWLCS (6471), pp.107-126. ⟨10.1007/978-3-642-17508-4_8⟩. ⟨hal-00542922⟩

Collections

UNIV-AG LAMIA
61 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More