Engine Tournament
-
q30
- Lives with ko
- Posts: 145
- Joined: Sat Aug 13, 2016 8:23 am
- Rank: 30 kyu
- GD Posts: 0
- Has thanked: 1 time
- Been thanked: 1 time
Re: Engine Tournament
The rating of Phoenix LeelaZero versions (details):
1) 0.16
2) lz_v0.33(0.15)
3) lz_v0.32(0.15)
4) lz_orig(0.15)
5) lizzie(0.15)
6) 0.14
But I don't understand, what is the reason of all these versions creating without the neuronet updating...
1) 0.16
2) lz_v0.33(0.15)
3) lz_v0.32(0.15)
4) lz_orig(0.15)
5) lizzie(0.15)
6) 0.14
But I don't understand, what is the reason of all these versions creating without the neuronet updating...
- spook
- Lives with ko
- Posts: 151
- Joined: Thu Jul 24, 2014 1:34 pm
- Rank: 2d
- GD Posts: 0
- KGS: LordVader
- Location: Belgium
- Has thanked: 11 times
- Been thanked: 48 times
- Contact:
Re: Engine Tournament
I never have problems, only challenges.q30 wrote:Yes, the description is the same and the lists are different (the list was updated: was added Zenith and was removed Ray(RLO) because of about sense). What is the problem
It wasn't obvious, but now it is. Thanks.
Enjoy LeeLaZero and KataGo from your webbrowser, without installing anything !
https://www.zbaduk.com
https://www.zbaduk.com
-
as0770
- Lives with ko
- Posts: 180
- Joined: Sun Jun 26, 2016 8:07 am
- Rank: Beginner
- GD Posts: 0
- Has thanked: 15 times
- Been thanked: 23 times
Re: Engine Tournament
We did not talk about Phoenix but about the significance of the results you posted. You pervert the facts once again. I just don't know if you do it on purpose or ignorantly.q30 wrote:This is really ridiculous, how You read posts...
Where You had found in the post by this link word "Phoenix" (or "Феникс")?!
-
Uberdude
- Judan
- Posts: 6727
- Joined: Thu Nov 24, 2011 11:35 am
- Rank: UK 4 dan
- GD Posts: 0
- KGS: Uberdude 4d
- OGS: Uberdude 7d
- Location: Cambridge, UK
- Has thanked: 436 times
- Been thanked: 3718 times
Re: Engine Tournament
as0770, why not ignore this thread instead of engaging in these pointless arguments?
-
as0770
- Lives with ko
- Posts: 180
- Joined: Sun Jun 26, 2016 8:07 am
- Rank: Beginner
- GD Posts: 0
- Has thanked: 15 times
- Been thanked: 23 times
Re: Engine Tournament
Oh well, I tried to delete my account here but I failed. So I will go on answering if someone tells nonsense. But feel free to delete my account.Uberdude wrote:as0770, why not ignore this thread instead of engaging in these pointless arguments?
-
AloneAgainstAll
- Lives with ko
- Posts: 127
- Joined: Thu May 16, 2019 10:16 am
- Rank: KGS 1d
- GD Posts: 0
- Has thanked: 2 times
- Been thanked: 21 times
Re: Engine Tournament
3 nov 2018
25 jul 2019
as0770 wrote:
I promise this is my last post in this thread...
25 jul 2019
I see strong contradiction here.as0770 wrote:
We did not talk about Phoenix but about the significance of the results you posted. You pervert the facts once again. I just don't know if you do it on purpose or ignorantly.
-
as0770
- Lives with ko
- Posts: 180
- Joined: Sun Jun 26, 2016 8:07 am
- Rank: Beginner
- GD Posts: 0
- Has thanked: 15 times
- Been thanked: 23 times
Re: Engine Tournament
Congrats, you found contradiction in a public forum.AloneAgainstAll wrote:3 nov 2018as0770 wrote:
I promise this is my last post in this thread...
25 jul 2019I see strong contradiction here.as0770 wrote:
We did not talk about Phoenix but about the significance of the results you posted. You pervert the facts once again. I just don't know if you do it on purpose or ignorantly.
I know I shouldn't read posts by blacklisted people, but now and then I forget my deliberate intention and read them because the forum software don't allow to blacklist someone completely. And then I feel this inner constraint and have to answer to the nonsense.
BTW I ignored it for a long time, but since I started this thread this guy wrote deprecative comments to me. I could live with it if there would be at least some basic expertise. But neither there is some know how, nor he understands english.
So after all I think it is OK to answer him once in a half year. Judge yourself if it is OK to blame me for that.
-
q30
- Lives with ko
- Posts: 145
- Joined: Sat Aug 13, 2016 8:23 am
- Rank: 30 kyu
- GD Posts: 0
- Has thanked: 1 time
- Been thanked: 1 time
Re: Engine Tournament
I had thought, that my post has problem...spook wrote:I never have problems, only challenges. :)q30 wrote:Yes, the description is the same and the lists are different (the list was updated: was added Zenith and was removed Ray(RLO) because of about sense). What is the problem
It wasn't obvious, but now it is. Thanks.
-
q30
- Lives with ko
- Posts: 145
- Joined: Sat Aug 13, 2016 8:23 am
- Rank: 30 kyu
- GD Posts: 0
- Has thanked: 1 time
- Been thanked: 1 time
Re: Engine Tournament
I don't know, what abstract significance You had told about, but I had told specifically about the significance in first case of original LeelaZero, when there was small overweight in the account in a match with small number of games, but big number of playouts,that was confirmed in a match with big number of games, but small number of playouts, and about the significance in second case of LeelaZero Phoenix (that post was about what), when there was big overweight in the account in a match with small number of games, but big number of playouts (and there wasn't any additional matches).as0770 wrote:We did not talk about Phoenix but about the significance of the results you posted. You pervert the facts once again. I just don't know if you do it on purpose or ignorantly.q30 wrote:This is really ridiculous, how You read posts...
Where You had found in the post by this link word "Phoenix" (or "Феникс")?!
So there wasn't any fact perversion, but was Your inattentive reading of posts...
-
q30
- Lives with ko
- Posts: 145
- Joined: Sat Aug 13, 2016 8:23 am
- Rank: 30 kyu
- GD Posts: 0
- Has thanked: 1 time
- Been thanked: 1 time
Re: Engine Tournament
The rating of original (weights file format) LeelaZero versions (details):
1) 0.17
2) lz_next_190222(0.16)
3&4) 0.16&0.14
5) 0.15(all)
The only 0.17 version is significantly stronger than previous.
One "trick": for getting not only the LeelaZero version, but and used weights file, replace in GTP.cpp the stroke by the one before engine compiling.
1) 0.17
2) lz_next_190222(0.16)
3&4) 0.16&0.14
5) 0.15(all)
The only 0.17 version is significantly stronger than previous.
One "trick": for getting not only the LeelaZero version, but and used weights file, replace in GTP.cpp the stroke
Code: Select all
gtp_printf(id, PROGRAM_VERSION);Code: Select all
gtp_printf(id, ' %s + %s', PROGRAM_VERSION, cfg_weightsfile.c_str());-
as0770
- Lives with ko
- Posts: 180
- Joined: Sun Jun 26, 2016 8:07 am
- Rank: Beginner
- GD Posts: 0
- Has thanked: 15 times
- Been thanked: 23 times
Re: Engine Tournament
Your main error in reasoning is that the statistical significance of a result will not increase with a higher number of playouts but only with a higher number of games.q30 wrote:in a match with small number of games, but big number of playouts,that was confirmed in a match with big number of games, but small number of playouts,
Re: Engine Tournament
This is not entirely correct. Higher playouts reduce the random factor in individual matches somewhat, making the result more representative. OC this is a weaker effect than the statistical validity coming from the number of samples (only increasing the weight of samples towards 1, whereas a match on low playouts may only worth 0.7, for example).as0770 wrote:the statistical significance of a result will not increase with a higher number of playouts but only with a higher number of games.
-
as0770
- Lives with ko
- Posts: 180
- Joined: Sun Jun 26, 2016 8:07 am
- Rank: Beginner
- GD Posts: 0
- Has thanked: 15 times
- Been thanked: 23 times
Re: Engine Tournament
In a match with x playouts the winning chance for an engine is y %. There is nothing like a random factor. What you mean is: Results with a higher number of playouts are more representative for the engines strength. Of course that's true. But that don't mean you get a statistical significant result with less games.jann wrote:This is not entirely correct. Higher playouts reduce the random factor in individual matches somewhat, making the result more representative. OC this is a weaker effect than the statistical validity coming from the number of samples (only increasing the weight of samples towards 1, whereas a match on low playouts may only worth 0.7, for example).as0770 wrote:the statistical significance of a result will not increase with a higher number of playouts but only with a higher number of games.
-
Bill Spight
- Honinbo
- Posts: 10905
- Joined: Wed Apr 21, 2010 1:24 pm
- Has thanked: 3651 times
- Been thanked: 3373 times
Re: Engine Tournament
The number of playouts is one parameter of an engine's strength.as0770 wrote:What you mean is: Results with a higher number of playouts are more representative for the engines strength.
The Adkins Principle:
At some point, doesn't thinking have to go on?
— Winona Adkins
Visualize whirled peas.
Everything with love. Stay safe.
At some point, doesn't thinking have to go on?
— Winona Adkins
Visualize whirled peas.
Everything with love. Stay safe.
Re: Engine Tournament
Without random factor the stronger net would always win (and the games may even be identical).as0770 wrote:There is nothing like a random factor.
A winrate of eg. 54% may go up to 58% with quadruple playouts. This 58% makes slightly more statistical mass from the same number of games (because each sample weights nearly 1, while at very low playouts game results are more random, thus weight less than 1 - carry less information).
The same number of more representative samples weights more than the same number of less representative samples. Maybe you understand better from a specific example: 102 games with 200 playouts are statistically less significant than 101 games with 2000 playouts (a weaker effect as mentioned).as0770 wrote:Results with a higher number of playouts are more representative for the engines strength. Of course that's true. But that don't mean you get a statistical significant result with less games.