Elf v2 @ 800 playouts

For discussing go computing, software announcements, etc.
Post Reply
Limeztone
Dies in gote
Posts: 63
Joined: Sun Jan 12, 2020 9:28 pm
GD Posts: 0
Has thanked: 8 times
Been thanked: 4 times

Elf v2 @ 800 playouts

Post by Limeztone »

What would the strength of the Elf-v2 weights in Leela-Zero-0.17 with 800 playouts be?

If I would estimate it to be about a 10 dan amateur would that be to high or to low?
Yakago
Dies in gote
Posts: 53
Joined: Tue Jan 16, 2018 10:39 am
GD Posts: 0
Has thanked: 2 times
Been thanked: 12 times

Re: Elf v2 @ 800 playouts

Post by Yakago »

Perhaps you could look at CGOS and find a bot/rating that you found to be equivalent to a 7d/8d/9d amateur and then compare to the ELFv2 p800 bot
http://www.yss-aya.com/cgos/19x19/cross ... _p800.html
Limeztone
Dies in gote
Posts: 63
Joined: Sun Jan 12, 2020 9:28 pm
GD Posts: 0
Has thanked: 8 times
Been thanked: 4 times

Re: Elf v2 @ 800 playouts

Post by Limeztone »

Yakago wrote:Perhaps you could look at CGOS and find a bot/rating that you found to be equivalent to a 7d/8d/9d amateur and then compare to the ELFv2 p800 bot
http://www.yss-aya.com/cgos/19x19/cross ... _p800.html
Thanks, but the problem is to estimate the strength of any bot :-)
As the ELFv2 p800 bot is fairly well known I found that to be a good starting point.
I will be happy to hear what every one think of the ELFv2 p800 strength.
splee99
Dies with sente
Posts: 101
Joined: Thu Nov 15, 2012 9:46 pm
Rank: KGS 2 D
GD Posts: 0
Has thanked: 2 times
Been thanked: 16 times

Re: Elf v2 @ 800 playouts

Post by splee99 »

Well I never heard any amateur 10 dan rank. We think we can safely put it into the professional rank range. However, it may be hard to invite any professionals to do any match with this bot. Just a wild guess, 1600p maybe around 9p.
Limeztone
Dies in gote
Posts: 63
Joined: Sun Jan 12, 2020 9:28 pm
GD Posts: 0
Has thanked: 8 times
Been thanked: 4 times

Re: Elf v2 @ 800 playouts

Post by Limeztone »

splee99 wrote:Well I never heard any amateur 10 dan rank. We think we can safely put it into the professional rank range. However, it may be hard to invite any professionals to do any match with this bot. Just a wild guess, 1600p maybe around 9p.
Thanks very much for your answer!
And how much weaker would you place the 800p instance?

And no, amateur usually turn pro before they reach 10 dan :-)
But the pro scale is a whole different thing.
Yakago
Dies in gote
Posts: 53
Joined: Tue Jan 16, 2018 10:39 am
GD Posts: 0
Has thanked: 2 times
Been thanked: 12 times

Re: Elf v2 @ 800 playouts

Post by Yakago »

I've had the opportunity to play KataGo 1.2 20 block with around 1600playouts (I manually placed stones on the board) against 2 strong amateurs at the level of, or slightly above european pros. The games were played without time constraints on the humans, although it was a casual setting so they didn't sit and think for hours.

Those games were very one-sided and both players lost by +30 pts.

Elfv2 is about the same as KataGo 1.2 at equal playouts, so Elf v2 at 800 playouts is probably a bit weaker.

Some tests suggests that doubling/halving the playouts is about 100 elo difference.

I don't know on which scale your '10d amateur' is to be placed. Fox/Tygem 10d? European 10d? Aga 10d? Korean 10d? There are many possibilities
Limeztone
Dies in gote
Posts: 63
Joined: Sun Jan 12, 2020 9:28 pm
GD Posts: 0
Has thanked: 8 times
Been thanked: 4 times

Re: Elf v2 @ 800 playouts

Post by Limeztone »

Yakago wrote: I don't know on which scale your '10d amateur' is to be placed. Fox/Tygem 10d? European 10d? Aga 10d? Korean 10d? There are many possibilities
Lets say European 10 dan.
Yakago
Dies in gote
Posts: 53
Joined: Tue Jan 16, 2018 10:39 am
GD Posts: 0
Has thanked: 2 times
Been thanked: 12 times

Re: Elf v2 @ 800 playouts

Post by Yakago »

10d European would imply a player that wins 90%+ against the players I mentioned (I think)

Yeah that's probably not too far off for Elfv2 p800. maybe stronger than 10d.
iopq
Dies with sente
Posts: 113
Joined: Wed Feb 27, 2019 11:19 am
Rank: 1d
GD Posts: 0
Universal go server handle: iopq
Has thanked: 11 times
Been thanked: 27 times

Re: Elf v2 @ 800 playouts

Post by iopq »

Yakago wrote:10d European would imply a player that wins 90%+ against the players I mentioned (I think)

Yeah that's probably not too far off for Elfv2 p800. maybe stronger than 10d.
It means wins vs. 8d at 2 stones handicap.

Which ELF easily does even on p800. But it would probably deteriorate at 3+ handicap
Yakago
Dies in gote
Posts: 53
Joined: Tue Jan 16, 2018 10:39 am
GD Posts: 0
Has thanked: 2 times
Been thanked: 12 times

Re: Elf v2 @ 800 playouts

Post by Yakago »

Yes - and because ELF is not particularly good at handicap compared to its strength, I wanted to avoid that comparison here.

I would like to compare in terms of ELO rather than amount of handicap stones I suppose, even though that becomes weird if someone wants to compare to perfect play ;)
jann
Lives in gote
Posts: 445
Joined: Tue May 14, 2019 8:00 pm
GD Posts: 0
Been thanked: 37 times

Re: Elf v2 @ 800 playouts

Post by jann »

iopq wrote:It means wins vs. 8d at 2 stones handicap.
Wins half at 2 EXTRA stones handicap (H3 giving komi).
Which ELF easily does even on p800.
I doubt these older bots can do meaningful H3 games. And you cannot reliably measure stone scale without handi games (Elo is not enough for this).
Yakago
Dies in gote
Posts: 53
Joined: Tue Jan 16, 2018 10:39 am
GD Posts: 0
Has thanked: 2 times
Been thanked: 12 times

Re: Elf v2 @ 800 playouts

Post by Yakago »

And you cannot reliably measure stone scale without handi games (Elo is not enough for this).
Agreed. But it is not a given that we are measuring 'dan levels' in that way.

Might as well be Elo, since that's for instance how EGF rating works, and European amateur dan level was specifically mentioned
jann
Lives in gote
Posts: 445
Joined: Tue May 14, 2019 8:00 pm
GD Posts: 0
Been thanked: 37 times

Re: Elf v2 @ 800 playouts

Post by jann »

I think EGF Elo based ranks just try to approximate the actual stone differences, also anchored by handi games. Those magic constants would not be determinable otherwise (at least I cannot imagine "dan levels" without stone scale).
Vargo
Lives in gote
Posts: 337
Joined: Sat Aug 17, 2013 5:28 am
GD Posts: 0
Has thanked: 22 times
Been thanked: 97 times

Re: Elf v2 @ 800 playouts

Post by Vargo »

40 game test at time parity : Elfv2 @p800 v. Katago 1.3.1 (20b s1420_d3509)
Katago wins 34-6 (85% , no error, no duplicate game, both program always agree on the score, Katago always appear as B, because of the command -alternate)
According to this site, winning 15% would put Elfv2@p800 ~300Elo behind Katago (or not ? :) )
k1.jpg
k1.jpg (147.08 KiB) Viewed 8229 times
k2.jpg
k2.jpg (349.28 KiB) Viewed 8229 times
PS. I had forgotten to change back the very high resign threshold for Katago, that's why its lost games are 300+ moves.
Post Reply