Adaptative play in texas hold'em poker

Raphael Maitrepierre; Jérémie Mary; Rémi Munos

Communication Dans Un Congrès Année : 2008

Adaptative play in texas hold'em poker

(1) , (1, 2) , (1)

1
2

Raphael Maitrepierre

Fonction : Auteur

Sequential Learning

Jérémie Mary

Fonction : Auteur
PersonId : 740984
IdHAL : jeremie-mary

Sequential Learning

Laboratoire d'Informatique Fondamentale de Lille

Rémi Munos

Fonction : Auteur
PersonId : 836863

Sequential Learning

Résumé

We present a Texas Hold'em poker player for limit heads-up games. Our bot is designed to adapt automatically to the strategy of the opponent and is not based on Nash equilibrium computation. The main idea is to design a bot that builds beliefs on his opponent's hand. A forest of game trees is generated according to those beliefs and the solutions of the trees are combined to make the best decision. The beliefs are updated during the game according to several methods, each of which corresponding to a basic strategy. We then use an exploration-exploitation bandit algorithm, namely the UCB (Upper Confidence Bound), to select a strategy to follow. This results in a global play that takes into account the opponent's strategy, and which turns out to be rather unpredictable. Indeed, if a given strategy is exploited by an opponent, the UCB algorithm will detect it using change point detection, and will choose another one. The initial resulting program , called Brennus, participated to the AAAI'07 Computer Poker Competition in both online and equilibrium competition and ranked eight out of seventeen competitors.

Domaines

Apprentissage [cs.LG]

Fichier principal

poker_ecai08.pdf (268.57 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Rémi Munos : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00830189

Soumis le : mardi 4 juin 2013-15:24:54

Dernière modification le : vendredi 24 mars 2023-14:52:57

Archivage à long terme le : jeudi 5 septembre 2013-04:23:18

Dates et versions

hal-00830189 , version 1 (04-06-2013)

Identifiants

HAL Id : hal-00830189 , version 1

Citer

Raphael Maitrepierre, Jérémie Mary, Rémi Munos. Adaptative play in texas hold'em poker. European Conference on Artificial Intelligence, 2008, France. ⟨hal-00830189⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-LILLE3 CNRS INRIA LIFL LAGIS INRIA2

171 Consultations

486 Téléchargements

Adaptative play in texas hold'em poker

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager