Skip to Main content Skip to Navigation
Journal articles

Building Accurate Strategies in Non Markovian Environments without Memory

Abstract : This paper focuses on the study of the behavior of a genetic algorithm based classier system, the Adapted Pittsburgh Classier System (A.P.C.S), on maze type environments containing aliasing squares. This type of environment is often used in reinforcement learning literature to assess the performances of learning methods when facing problems containing non markovian situations. Through this study, we discuss on the performance of the APCS upon two mazes (Woods 101 and Maze E2) and also on the eciency of an improvement of the APCS learning method inspired from the XCS: the covering mechanism. We manage to show that, without any memory mechanism, the APCS is able to build and to keep accurate strategies to produce regular sub-optimal solution to these maze problems. This statement is shown through a comparison between the results obtained by the XCS on two specic maze problems and those obtained by the APCS.
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-00542922
Contributor : Mathias Peroumalnaïk <>
Submitted on : Friday, December 3, 2010 - 11:20:18 PM
Last modification on : Wednesday, December 19, 2018 - 1:08:13 PM

Links full text

Identifiers

Collections

Citation

Enée Gilles, Mathias Peroumalnaïk. Building Accurate Strategies in Non Markovian Environments without Memory. Lecture Notes in Computer Science, Springer, 2010, IWLCS (6471), pp.107-126. ⟨10.1007/978-3-642-17508-4_8⟩. ⟨hal-00542922⟩

Share

Metrics

Record views

114