Adaptative play in texas hold'em poker - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2008

Adaptative play in texas hold'em poker

Résumé

We present a Texas Hold'em poker player for limit heads-up games. Our bot is designed to adapt automatically to the strategy of the opponent and is not based on Nash equilibrium computation. The main idea is to design a bot that builds beliefs on his opponent's hand. A forest of game trees is generated according to those beliefs and the solutions of the trees are combined to make the best decision. The beliefs are updated during the game according to several methods, each of which corresponding to a basic strategy. We then use an exploration-exploitation bandit algorithm, namely the UCB (Upper Confidence Bound), to select a strategy to follow. This results in a global play that takes into account the opponent's strategy, and which turns out to be rather unpredictable. Indeed, if a given strategy is exploited by an opponent, the UCB algorithm will detect it using change point detection, and will choose another one. The initial resulting program , called Brennus, participated to the AAAI'07 Computer Poker Competition in both online and equilibrium competition and ranked eight out of seventeen competitors.
Fichier principal
Vignette du fichier
poker_ecai08.pdf (268.57 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00830189 , version 1 (04-06-2013)

Identifiants

  • HAL Id : hal-00830189 , version 1

Citer

Raphael Maitrepierre, Jérémie Mary, Rémi Munos. Adaptative play in texas hold'em poker. European Conference on Artificial Intelligence, 2008, France. ⟨hal-00830189⟩
171 Consultations
481 Téléchargements

Partager

Gmail Facebook X LinkedIn More