Elf v2 @ 800 playouts
-
Limeztone
- Dies in gote
- Posts: 63
- Joined: Sun Jan 12, 2020 9:28 pm
- GD Posts: 0
- Has thanked: 8 times
- Been thanked: 4 times
Elf v2 @ 800 playouts
What would the strength of the Elf-v2 weights in Leela-Zero-0.17 with 800 playouts be?
If I would estimate it to be about a 10 dan amateur would that be to high or to low?
If I would estimate it to be about a 10 dan amateur would that be to high or to low?
-
Yakago
- Dies in gote
- Posts: 53
- Joined: Tue Jan 16, 2018 10:39 am
- GD Posts: 0
- Has thanked: 2 times
- Been thanked: 12 times
Re: Elf v2 @ 800 playouts
Perhaps you could look at CGOS and find a bot/rating that you found to be equivalent to a 7d/8d/9d amateur and then compare to the ELFv2 p800 bot
http://www.yss-aya.com/cgos/19x19/cross ... _p800.html
http://www.yss-aya.com/cgos/19x19/cross ... _p800.html
-
Limeztone
- Dies in gote
- Posts: 63
- Joined: Sun Jan 12, 2020 9:28 pm
- GD Posts: 0
- Has thanked: 8 times
- Been thanked: 4 times
Re: Elf v2 @ 800 playouts
Thanks, but the problem is to estimate the strength of any botYakago wrote:Perhaps you could look at CGOS and find a bot/rating that you found to be equivalent to a 7d/8d/9d amateur and then compare to the ELFv2 p800 bot
http://www.yss-aya.com/cgos/19x19/cross ... _p800.html
As the ELFv2 p800 bot is fairly well known I found that to be a good starting point.
I will be happy to hear what every one think of the ELFv2 p800 strength.
-
splee99
- Dies with sente
- Posts: 101
- Joined: Thu Nov 15, 2012 9:46 pm
- Rank: KGS 2 D
- GD Posts: 0
- Has thanked: 2 times
- Been thanked: 16 times
Re: Elf v2 @ 800 playouts
Well I never heard any amateur 10 dan rank. We think we can safely put it into the professional rank range. However, it may be hard to invite any professionals to do any match with this bot. Just a wild guess, 1600p maybe around 9p.
-
Limeztone
- Dies in gote
- Posts: 63
- Joined: Sun Jan 12, 2020 9:28 pm
- GD Posts: 0
- Has thanked: 8 times
- Been thanked: 4 times
Re: Elf v2 @ 800 playouts
Thanks very much for your answer!splee99 wrote:Well I never heard any amateur 10 dan rank. We think we can safely put it into the professional rank range. However, it may be hard to invite any professionals to do any match with this bot. Just a wild guess, 1600p maybe around 9p.
And how much weaker would you place the 800p instance?
And no, amateur usually turn pro before they reach 10 dan
But the pro scale is a whole different thing.
-
Yakago
- Dies in gote
- Posts: 53
- Joined: Tue Jan 16, 2018 10:39 am
- GD Posts: 0
- Has thanked: 2 times
- Been thanked: 12 times
Re: Elf v2 @ 800 playouts
I've had the opportunity to play KataGo 1.2 20 block with around 1600playouts (I manually placed stones on the board) against 2 strong amateurs at the level of, or slightly above european pros. The games were played without time constraints on the humans, although it was a casual setting so they didn't sit and think for hours.
Those games were very one-sided and both players lost by +30 pts.
Elfv2 is about the same as KataGo 1.2 at equal playouts, so Elf v2 at 800 playouts is probably a bit weaker.
Some tests suggests that doubling/halving the playouts is about 100 elo difference.
I don't know on which scale your '10d amateur' is to be placed. Fox/Tygem 10d? European 10d? Aga 10d? Korean 10d? There are many possibilities
Those games were very one-sided and both players lost by +30 pts.
Elfv2 is about the same as KataGo 1.2 at equal playouts, so Elf v2 at 800 playouts is probably a bit weaker.
Some tests suggests that doubling/halving the playouts is about 100 elo difference.
I don't know on which scale your '10d amateur' is to be placed. Fox/Tygem 10d? European 10d? Aga 10d? Korean 10d? There are many possibilities
-
Limeztone
- Dies in gote
- Posts: 63
- Joined: Sun Jan 12, 2020 9:28 pm
- GD Posts: 0
- Has thanked: 8 times
- Been thanked: 4 times
Re: Elf v2 @ 800 playouts
Lets say European 10 dan.Yakago wrote: I don't know on which scale your '10d amateur' is to be placed. Fox/Tygem 10d? European 10d? Aga 10d? Korean 10d? There are many possibilities
-
Yakago
- Dies in gote
- Posts: 53
- Joined: Tue Jan 16, 2018 10:39 am
- GD Posts: 0
- Has thanked: 2 times
- Been thanked: 12 times
Re: Elf v2 @ 800 playouts
10d European would imply a player that wins 90%+ against the players I mentioned (I think)
Yeah that's probably not too far off for Elfv2 p800. maybe stronger than 10d.
Yeah that's probably not too far off for Elfv2 p800. maybe stronger than 10d.
-
iopq
- Dies with sente
- Posts: 113
- Joined: Wed Feb 27, 2019 11:19 am
- Rank: 1d
- GD Posts: 0
- Universal go server handle: iopq
- Has thanked: 11 times
- Been thanked: 27 times
Re: Elf v2 @ 800 playouts
It means wins vs. 8d at 2 stones handicap.Yakago wrote:10d European would imply a player that wins 90%+ against the players I mentioned (I think)
Yeah that's probably not too far off for Elfv2 p800. maybe stronger than 10d.
Which ELF easily does even on p800. But it would probably deteriorate at 3+ handicap
-
Yakago
- Dies in gote
- Posts: 53
- Joined: Tue Jan 16, 2018 10:39 am
- GD Posts: 0
- Has thanked: 2 times
- Been thanked: 12 times
Re: Elf v2 @ 800 playouts
Yes - and because ELF is not particularly good at handicap compared to its strength, I wanted to avoid that comparison here.
I would like to compare in terms of ELO rather than amount of handicap stones I suppose, even though that becomes weird if someone wants to compare to perfect play
I would like to compare in terms of ELO rather than amount of handicap stones I suppose, even though that becomes weird if someone wants to compare to perfect play
Re: Elf v2 @ 800 playouts
Wins half at 2 EXTRA stones handicap (H3 giving komi).iopq wrote:It means wins vs. 8d at 2 stones handicap.
I doubt these older bots can do meaningful H3 games. And you cannot reliably measure stone scale without handi games (Elo is not enough for this).Which ELF easily does even on p800.
-
Yakago
- Dies in gote
- Posts: 53
- Joined: Tue Jan 16, 2018 10:39 am
- GD Posts: 0
- Has thanked: 2 times
- Been thanked: 12 times
Re: Elf v2 @ 800 playouts
Agreed. But it is not a given that we are measuring 'dan levels' in that way.And you cannot reliably measure stone scale without handi games (Elo is not enough for this).
Might as well be Elo, since that's for instance how EGF rating works, and European amateur dan level was specifically mentioned
Re: Elf v2 @ 800 playouts
I think EGF Elo based ranks just try to approximate the actual stone differences, also anchored by handi games. Those magic constants would not be determinable otherwise (at least I cannot imagine "dan levels" without stone scale).
-
Vargo
- Lives in gote
- Posts: 337
- Joined: Sat Aug 17, 2013 5:28 am
- GD Posts: 0
- Has thanked: 22 times
- Been thanked: 97 times
Re: Elf v2 @ 800 playouts
40 game test at time parity : Elfv2 @p800 v. Katago 1.3.1 (20b s1420_d3509)
Katago wins 34-6 (85% , no error, no duplicate game, both program always agree on the score, Katago always appear as B, because of the command -alternate)
According to this site, winning 15% would put Elfv2@p800 ~300Elo behind Katago (or not ?
)
PS. I had forgotten to change back the very high resign threshold for Katago, that's why its lost games are 300+ moves.
Katago wins 34-6 (85% , no error, no duplicate game, both program always agree on the score, Katago always appear as B, because of the command -alternate)
According to this site, winning 15% would put Elfv2@p800 ~300Elo behind Katago (or not ?