Exploration exploitation in Go: UCT for Monte-Carlo Go - Archive ouverte HAL Access content directly
Conference Papers Year : 2006

Exploration exploitation in Go: UCT for Monte-Carlo Go

Abstract

Algorithm UCB1 for multi-armed bandit problem has already been extended to Algorithm UCT which works for minimax tree search. We have developed a Monte-Carlo program, MoGo, which is the first computer Go program using UCT. We explain our modifications of UCT for Go application, among which efficient memory management, parametrization, ordering of non-visited nodes and parallelization. MoGo is now a top-level Computer-Go program on 9 x 9 Go board.
Fichier principal
Vignette du fichier
nips_exploration_exploitation.pdf (100.05 Ko) Télécharger le fichier
Origin : Files produced by the author(s)
Loading...

Dates and versions

hal-00115330 , version 1 (21-11-2006)

Identifiers

  • HAL Id : hal-00115330 , version 1

Cite

Sylvain Gelly, Yizao Wang. Exploration exploitation in Go: UCT for Monte-Carlo Go. NIPS: Neural Information Processing Systems Conference On-line trading of Exploration and Exploitation Workshop, Dec 2006, Canada. ⟨hal-00115330⟩
1770 View
2793 Download

Share

Gmail Facebook X LinkedIn More