Win Rates and Visits in ELF

For discussing go computing, software announcements, etc.
Post Reply
hyperpape
Tengen
Posts: 4382
Joined: Thu May 06, 2010 3:24 pm
Rank: AGA 3k
GD Posts: 65
OGS: Hyperpape 4k
Location: Caldas da Rainha, Portugal
Has thanked: 499 times
Been thanked: 727 times

Win Rates and Visits in ELF

Post by hyperpape »

In MCTS, you expect that given adequate time, the move with the highest win percentage will also have the most visits. Is this also true for ELF? If so, how long would you expect it to take for the two to converge? I have failed to see it after a few thousand playouts, but I suppose that’s not that many.
Uberdude
Judan
Posts: 6727
Joined: Thu Nov 24, 2011 11:35 am
Rank: UK 4 dan
GD Posts: 0
KGS: Uberdude 4d
OGS: Uberdude 7d
Location: Cambridge, UK
Has thanked: 436 times
Been thanked: 3718 times

Re: Win Rates and Visits in ELF

Post by Uberdude »

A few thousand is tiny. If I want to do some serious analysis of a position I'll give Elf 50-100k playouts.
moha
Lives in gote
Posts: 311
Joined: Wed May 31, 2017 6:49 am
Rank: 2d
GD Posts: 0
Been thanked: 45 times

Re: Win Rates and Visits in ELF

Post by moha »

hyperpape wrote:In MCTS, you expect that given adequate time, the move with the highest win percentage will also have the most visits.
I don't think this is necessarily true, only if search did not find new moves in upper levels (or important followups below them) for a while, and there are no close candidates.

But generally, both visits and winrates will lag behind latest knowledge, only in different ways (visits can only accumulate slowly, winrates can only change by averaging into them).
hyperpape
Tengen
Posts: 4382
Joined: Thu May 06, 2010 3:24 pm
Rank: AGA 3k
GD Posts: 65
OGS: Hyperpape 4k
Location: Caldas da Rainha, Portugal
Has thanked: 499 times
Been thanked: 727 times

Re: Win Rates and Visits in ELF

Post by hyperpape »

Good point, moha. In this case, however, it's a relatively quiet opening position, with no major changes in the win percentages for each move as the simulations proceed. I'll try to post examples soon.

Uberdude: I'm aware it's a smallish simulation, but this is a "low level" property. By that, I mean I'm not asking "why isn't the engine seeing a particular move or evaluating things in the right way?" What I'm asking is "why isn't it adequately exploring the move it gives the highest percentage chance?" I would think it would not have required that many simulations. Maybe I need to work some examples with the formula from Remi's paper to get a better intuition for how fast a move will be explored.
Post Reply