Engine Tournament

For discussing go computing, software announcements, etc.
q30
Lives with ko
Posts: 145
Joined: Sat Aug 13, 2016 8:23 am
Rank: 30 kyu
GD Posts: 0
Has thanked: 1 time
Been thanked: 1 time

Re: Engine Tournament

Post by q30 »

The rating of Phoenix LeelaZero versions (details):
1) 0.16
2) lz_v0.33(0.15)
3) lz_v0.32(0.15)
4) lz_orig(0.15)
5) lizzie(0.15)
6) 0.14

But I don't understand, what is the reason of all these versions creating without the neuronet updating...
User avatar
spook
Lives with ko
Posts: 151
Joined: Thu Jul 24, 2014 1:34 pm
Rank: 2d
GD Posts: 0
KGS: LordVader
Location: Belgium
Has thanked: 11 times
Been thanked: 48 times
Contact:

Re: Engine Tournament

Post by spook »

q30 wrote:Yes, the description is the same and the lists are different (the list was updated: was added Zenith and was removed Ray(RLO) because of about sense). What is the problem
I never have problems, only challenges. :)
It wasn't obvious, but now it is. Thanks.
Enjoy LeeLaZero and KataGo from your webbrowser, without installing anything !
https://www.zbaduk.com
as0770
Lives with ko
Posts: 180
Joined: Sun Jun 26, 2016 8:07 am
Rank: Beginner
GD Posts: 0
Has thanked: 15 times
Been thanked: 23 times

Re: Engine Tournament

Post by as0770 »

q30 wrote:This is really ridiculous, how You read posts...
Where You had found in the post by this link word "Phoenix" (or "Феникс")?!
We did not talk about Phoenix but about the significance of the results you posted. You pervert the facts once again. I just don't know if you do it on purpose or ignorantly.
Uberdude
Judan
Posts: 6727
Joined: Thu Nov 24, 2011 11:35 am
Rank: UK 4 dan
GD Posts: 0
KGS: Uberdude 4d
OGS: Uberdude 7d
Location: Cambridge, UK
Has thanked: 436 times
Been thanked: 3718 times

Re: Engine Tournament

Post by Uberdude »

as0770, why not ignore this thread instead of engaging in these pointless arguments?
as0770
Lives with ko
Posts: 180
Joined: Sun Jun 26, 2016 8:07 am
Rank: Beginner
GD Posts: 0
Has thanked: 15 times
Been thanked: 23 times

Re: Engine Tournament

Post by as0770 »

Uberdude wrote:as0770, why not ignore this thread instead of engaging in these pointless arguments?
Oh well, I tried to delete my account here but I failed. So I will go on answering if someone tells nonsense. But feel free to delete my account.
AloneAgainstAll
Lives with ko
Posts: 127
Joined: Thu May 16, 2019 10:16 am
Rank: KGS 1d
GD Posts: 0
Has thanked: 2 times
Been thanked: 21 times

Re: Engine Tournament

Post by AloneAgainstAll »

3 nov 2018
as0770 wrote:
I promise this is my last post in this thread...

25 jul 2019
as0770 wrote:
We did not talk about Phoenix but about the significance of the results you posted. You pervert the facts once again. I just don't know if you do it on purpose or ignorantly.
I see strong contradiction here.
as0770
Lives with ko
Posts: 180
Joined: Sun Jun 26, 2016 8:07 am
Rank: Beginner
GD Posts: 0
Has thanked: 15 times
Been thanked: 23 times

Re: Engine Tournament

Post by as0770 »

AloneAgainstAll wrote:3 nov 2018
as0770 wrote:
I promise this is my last post in this thread...

25 jul 2019
as0770 wrote:
We did not talk about Phoenix but about the significance of the results you posted. You pervert the facts once again. I just don't know if you do it on purpose or ignorantly.
I see strong contradiction here.
Congrats, you found contradiction in a public forum. :clap:

I know I shouldn't read posts by blacklisted people, but now and then I forget my deliberate intention and read them because the forum software don't allow to blacklist someone completely. And then I feel this inner constraint and have to answer to the nonsense.

BTW I ignored it for a long time, but since I started this thread this guy wrote deprecative comments to me. I could live with it if there would be at least some basic expertise. But neither there is some know how, nor he understands english.

So after all I think it is OK to answer him once in a half year. Judge yourself if it is OK to blame me for that.
q30
Lives with ko
Posts: 145
Joined: Sat Aug 13, 2016 8:23 am
Rank: 30 kyu
GD Posts: 0
Has thanked: 1 time
Been thanked: 1 time

Re: Engine Tournament

Post by q30 »

spook wrote:
q30 wrote:Yes, the description is the same and the lists are different (the list was updated: was added Zenith and was removed Ray(RLO) because of about sense). What is the problem
I never have problems, only challenges. :)
It wasn't obvious, but now it is. Thanks.
I had thought, that my post has problem...
q30
Lives with ko
Posts: 145
Joined: Sat Aug 13, 2016 8:23 am
Rank: 30 kyu
GD Posts: 0
Has thanked: 1 time
Been thanked: 1 time

Re: Engine Tournament

Post by q30 »

as0770 wrote:
q30 wrote:This is really ridiculous, how You read posts...
Where You had found in the post by this link word "Phoenix" (or "Феникс")?!
We did not talk about Phoenix but about the significance of the results you posted. You pervert the facts once again. I just don't know if you do it on purpose or ignorantly.
I don't know, what abstract significance You had told about, but I had told specifically about the significance in first case of original LeelaZero, when there was small overweight in the account in a match with small number of games, but big number of playouts,that was confirmed in a match with big number of games, but small number of playouts, and about the significance in second case of LeelaZero Phoenix (that post was about what), when there was big overweight in the account in a match with small number of games, but big number of playouts (and there wasn't any additional matches).
So there wasn't any fact perversion, but was Your inattentive reading of posts...
q30
Lives with ko
Posts: 145
Joined: Sat Aug 13, 2016 8:23 am
Rank: 30 kyu
GD Posts: 0
Has thanked: 1 time
Been thanked: 1 time

Re: Engine Tournament

Post by q30 »

The rating of original (weights file format) LeelaZero versions (details):
1) 0.17
2) lz_next_190222(0.16)
3&4) 0.16&0.14
5) 0.15(all)

The only 0.17 version is significantly stronger than previous.

One "trick": for getting not only the LeelaZero version, but and used weights file, replace in GTP.cpp the stroke

Code: Select all

gtp_printf(id, PROGRAM_VERSION);
by the one

Code: Select all

gtp_printf(id, ' %s + %s', PROGRAM_VERSION, cfg_weightsfile.c_str());
before engine compiling.
as0770
Lives with ko
Posts: 180
Joined: Sun Jun 26, 2016 8:07 am
Rank: Beginner
GD Posts: 0
Has thanked: 15 times
Been thanked: 23 times

Re: Engine Tournament

Post by as0770 »

q30 wrote:in a match with small number of games, but big number of playouts,that was confirmed in a match with big number of games, but small number of playouts,
Your main error in reasoning is that the statistical significance of a result will not increase with a higher number of playouts but only with a higher number of games.
jann
Lives in gote
Posts: 445
Joined: Tue May 14, 2019 8:00 pm
GD Posts: 0
Been thanked: 37 times

Re: Engine Tournament

Post by jann »

as0770 wrote:the statistical significance of a result will not increase with a higher number of playouts but only with a higher number of games.
This is not entirely correct. Higher playouts reduce the random factor in individual matches somewhat, making the result more representative. OC this is a weaker effect than the statistical validity coming from the number of samples (only increasing the weight of samples towards 1, whereas a match on low playouts may only worth 0.7, for example).
as0770
Lives with ko
Posts: 180
Joined: Sun Jun 26, 2016 8:07 am
Rank: Beginner
GD Posts: 0
Has thanked: 15 times
Been thanked: 23 times

Re: Engine Tournament

Post by as0770 »

jann wrote:
as0770 wrote:the statistical significance of a result will not increase with a higher number of playouts but only with a higher number of games.
This is not entirely correct. Higher playouts reduce the random factor in individual matches somewhat, making the result more representative. OC this is a weaker effect than the statistical validity coming from the number of samples (only increasing the weight of samples towards 1, whereas a match on low playouts may only worth 0.7, for example).
In a match with x playouts the winning chance for an engine is y %. There is nothing like a random factor. What you mean is: Results with a higher number of playouts are more representative for the engines strength. Of course that's true. But that don't mean you get a statistical significant result with less games.
Bill Spight
Honinbo
Posts: 10905
Joined: Wed Apr 21, 2010 1:24 pm
Has thanked: 3651 times
Been thanked: 3373 times

Re: Engine Tournament

Post by Bill Spight »

as0770 wrote:What you mean is: Results with a higher number of playouts are more representative for the engines strength.
The number of playouts is one parameter of an engine's strength.
The Adkins Principle:
At some point, doesn't thinking have to go on?
— Winona Adkins

Visualize whirled peas.

Everything with love. Stay safe.
jann
Lives in gote
Posts: 445
Joined: Tue May 14, 2019 8:00 pm
GD Posts: 0
Been thanked: 37 times

Re: Engine Tournament

Post by jann »

as0770 wrote:There is nothing like a random factor.
Without random factor the stronger net would always win (and the games may even be identical).

A winrate of eg. 54% may go up to 58% with quadruple playouts. This 58% makes slightly more statistical mass from the same number of games (because each sample weights nearly 1, while at very low playouts game results are more random, thus weight less than 1 - carry less information).
as0770 wrote:Results with a higher number of playouts are more representative for the engines strength. Of course that's true. But that don't mean you get a statistical significant result with less games.
The same number of more representative samples weights more than the same number of less representative samples. Maybe you understand better from a specific example: 102 games with 200 playouts are statistically less significant than 101 games with 2000 playouts (a weaker effect as mentioned).
Post Reply