Building Accurate Strategies in Non Markovian Environments without Memory

Abstract : This paper focuses on the study of the behavior of a genetic algorithm based classier system, the Adapted Pittsburgh Classier System (A.P.C.S), on maze type environments containing aliasing squares. This type of environment is often used in reinforcement learning literature to assess the performances of learning methods when facing problems containing non markovian situations. Through this study, we discuss on the performance of the APCS upon two mazes (Woods 101 and Maze E2) and also on the eciency of an improvement of the APCS learning method inspired from the XCS: the covering mechanism. We manage to show that, without any memory mechanism, the APCS is able to build and to keep accurate strategies to produce regular sub-optimal solution to these maze problems. This statement is shown through a comparison between the results obtained by the XCS on two specic maze problems and those obtained by the APCS.
Type de document :
Article dans une revue
Lecture notes in computer science, springer, 2010, IWLCS (6471), pp.107-126. 〈10.1007/978-3-642-17508-4_8〉
Liste complète des métadonnées

https://hal.archives-ouvertes.fr/hal-00542922
Contributeur : Mathias Peroumalnaïk <>
Soumis le : vendredi 3 décembre 2010 - 23:20:18
Dernière modification le : vendredi 3 décembre 2010 - 23:20:18

Identifiants

Collections

Citation

Enée Gilles, Mathias Peroumalnaïk. Building Accurate Strategies in Non Markovian Environments without Memory. Lecture notes in computer science, springer, 2010, IWLCS (6471), pp.107-126. 〈10.1007/978-3-642-17508-4_8〉. 〈hal-00542922〉

Partager

Métriques

Consultations de la notice

52