It is currently Thu Mar 28, 2024 4:02 am

All times are UTC - 8 hours [ DST ]




Post new topic Reply to topic  [ 390 posts ]  Go to page Previous  1 ... 10, 11, 12, 13, 14, 15, 16 ... 20  Next
Author Message
Offline
 Post subject: Re: Engine Tournament
Post #241 Posted: Sat Sep 15, 2018 5:59 am 
Lives in gote

Posts: 337
Liked others: 22
Was liked: 97
as0770 wrote:
If one engine scores 55% in a 400 game match, there is still a chance of 1% that it is _not_ stronger.
It's even more, around 2.7%


This post by Vargo was liked by: as0770
Top
 Profile  
 
Offline
 Post subject: Re: Engine Tournament
Post #242 Posted: Sat Sep 15, 2018 9:20 am 
Lives with ko

Posts: 135
Liked others: 1
Was liked: 1
Rank: 30 kyu
as0770 wrote:

Or a weight file of similar strength. And once again: If one engine scores 55% in a 400 game match, there is still a chance of 1% that it is _not_ stronger.

I didn't argue, that v. 0.14 absolutely stronger than v. 0.15, but most likely it's not weaker. And IMHO if the next version not stronger, than previous, that it's weakness of first one...
as0770 wrote:
Your link points to LeelaZero_PhoenixGo v1 and v2 but you were talking about Leela Zero v0.14 and 0.15. LeelaZero 0.14 don't support Phoenix networks. Results of LeelaZero_PhoenixGo v1 vs. LeelaZero_PhoenixGo v2 with Phoeniox Networks don't say anything about the strength of LeelaZero v0.14 and 0.15. I wouldn't even trust results with Leela networks.

If You'll open really page by the link, You'll find the next text (content of Readme.md file):
Quote:
So 'v1' & 'v2' are only the folders (i.e. a parts of link paths) and the engines are: "leela-zero-0.14-phoenix-go" and "leela-zero-0.15-phoenix-go"...

Top
 Profile  
 
Offline
 Post subject: Re: Engine Tournament
Post #243 Posted: Sat Sep 15, 2018 9:27 am 
Lives with ko

Posts: 135
Liked others: 1
Was liked: 1
Rank: 30 kyu
Vargo wrote:
as0770 wrote:
If one engine scores 55% in a 400 game match, there is still a chance of 1% that it is _not_ stronger.
It's even more, around 2.7%

And these quantitative calculations are true in used conditions (time control, number of playouts and so on) only...

Top
 Profile  
 
Offline
 Post subject: Re: Engine Tournament
Post #244 Posted: Sat Sep 15, 2018 1:51 pm 
Lives with ko

Posts: 180
Liked others: 15
Was liked: 23
Rank: Beginner
q30 wrote:
I didn't argue, that v. 0.14 absolutely stronger than v. 0.15,


You did. Thats what all the discussion is about:

q30 wrote:
LeelaZero 0.14 is a bit stronger (not as much, as in case of Phoenix), than 0.15 (details)...


q30 wrote:
If You'll open really page by the link, You'll find the next text


Nevertheless it is not the same as the official Leela branch, did I already mention that Leela v0.14 doesn't support Phoenix networks?

Top
 Profile  
 
Offline
 Post subject: Re: Engine Tournament
Post #245 Posted: Sat Sep 22, 2018 8:13 am 
Lives with ko

Posts: 135
Liked others: 1
Was liked: 1
Rank: 30 kyu
as0770 wrote:
q30 wrote:
I didn't argue, that v. 0.14 absolutely stronger than v. 0.15,


You did. Thats what all the discussion is about:

q30 wrote:
LeelaZero 0.14 is a bit stronger (not as much, as in case of Phoenix), than 0.15 (details)...


q30 wrote:
If You'll open really page by the link, You'll find the next text


Nevertheless it is not the same as the official Leela branch, did I already mention that Leela v0.14 doesn't support Phoenix networks?

I think, You agree, that 0.15 version is not stronger, than 0.14 one. For most usual users it's the same, that new version is weaker: it's no a reason to download it. Have You any result of sparring, where 0.15 version was at most a bit stronger,than 0.14 one?
You can do yourself 2 matches of v.014 vs v.0.15 with enough for You number of games with a very big time control (10 '/move, for example) and with GPU enabled: 1) with -p 1; 2) with -p 10000. If in second match will more % win of one engine, than it's stronger.

Version 0.14 of Phoenix LeelaZero supports Phoenix neuronets: see results

Top
 Profile  
 
Offline
 Post subject: Re: Engine Tournament
Post #246 Posted: Sat Sep 22, 2018 11:03 am 
Lives with ko

Posts: 180
Liked others: 15
Was liked: 23
Rank: Beginner
q30 wrote:
as0770 wrote:
q30 wrote:
I didn't argue, that v. 0.14 absolutely stronger than v. 0.15,


You did. Thats what all the discussion is about:

q30 wrote:
LeelaZero 0.14 is a bit stronger (not as much, as in case of Phoenix), than 0.15 (details)...


q30 wrote:
If You'll open really page by the link, You'll find the next text


Nevertheless it is not the same as the official Leela branch, did I already mention that Leela v0.14 doesn't support Phoenix networks?

I think, You agree, that 0.15 version is not stronger, than 0.14 one. For most usual users it's the same, that new version is weaker: it's no a reason to download it. Have You any result of sparring, where 0.15 version was at most a bit stronger,than 0.14 one?
You can do yourself 2 matches of v.014 vs v.0.15 with enough for You number of games with a very big time control (10 '/move, for example) and with GPU enabled: 1) with -p 1; 2) with -p 10000. If in second match will more % win of one engine, than it's stronger.

Version 0.14 of Phoenix LeelaZero supports Phoenix neuronets: see results


Don't you read the answers to your posts, do you just ignore them or don't you understand them? That is really ridiculous...


This post by as0770 was liked by: And
Top
 Profile  
 
Offline
 Post subject: Re: Engine Tournament
Post #247 Posted: Sun Sep 23, 2018 4:44 am 
Dies with sente
User avatar

Posts: 77
Location: Warsaw, PL
Liked others: 165
Was liked: 18
Rank: EGF 3k
Universal go server handle: yakcyll
I'm not a moderator here, but I'm pretty sure I'm not the only one to think this exchange has already gone far enough. If you want to keep throwing ad hominems around, it's best to keep it to private messages; pretty sure this thread was never meant to serve this purpose.


This post by yakcyll was liked by: Uberdude
Top
 Profile  
 
Offline
 Post subject: Re: Engine Tournament
Post #248 Posted: Sun Sep 23, 2018 5:51 am 
Lives with ko

Posts: 180
Liked others: 15
Was liked: 23
Rank: Beginner
yakcyll wrote:
I'm not a moderator here, but I'm pretty sure I'm not the only one to think this exchange has already gone far enough. If you want to keep throwing ad hominems around, it's best to keep it to private messages; pretty sure this thread was never meant to serve this purpose.


I simply try to answer questions here. It is somehow frustrating when you take time and effort to explain simple little facts, and the respondent consequently ignores them. I don't know if he is just trolling or if he don't understands anything. The purpose of a public forum is to discuss in public. And btw your post is way more off topic than ours. If you don't like the thread, just ignore it.

Top
 Profile  
 
Offline
 Post subject: Re: Engine Tournament
Post #249 Posted: Sat Oct 06, 2018 1:02 am 
Lives with ko

Posts: 135
Liked others: 1
Was liked: 1
Rank: 30 kyu
as0770 wrote:
...

Don't you read the answers to your posts, do you just ignore them or don't you understand them? That is really ridiculous...


I didn't understand English well. But the same question is for You...
If You yourself read and understand answers to Your posts, can You give answer to the question: You aren't agree, that Phoenix LeelaZero 0.14 is stronger, than 0.15 one (and if don't agree, than were is return proof in facts/results)?

Top
 Profile  
 
Offline
 Post subject: Re: Engine Tournament
Post #250 Posted: Sat Oct 06, 2018 2:25 am 
Lives in gote

Posts: 337
Liked others: 22
Was liked: 97
Someone call the referee, plz.... ;-)

Top
 Profile  
 
Offline
 Post subject: Re: Engine Tournament
Post #251 Posted: Sat Oct 06, 2018 8:36 am 
Lives with ko

Posts: 135
Liked others: 1
Was liked: 1
Rank: 30 kyu
The referee can be anyone with modern Nvidia video card, that will do the next:
Quote:
2 matches of v.014 vs v.0.15 with statistically enough number of games with a very big time control (10 '/move, for example) and with GPU enabled: 1) with -p 1; 2) with -p 10000. If in second match will more % win of one engine, than it's stronger (for more reliability one can add the third match with -p 1000 to receive intermediate value of % win).
(especially in case of Phoenix)...

Top
 Profile  
 
Offline
 Post subject: Re: Engine Tournament
Post #252 Posted: Sat Oct 06, 2018 10:53 am 
Lives with ko

Posts: 180
Liked others: 15
Was liked: 23
Rank: Beginner
q30 wrote:
You aren't agree, that Phoenix LeelaZero 0.14 is stronger, than 0.15 one


I didn't say it is not stronger. I said 50,8% winrate is far away from proofing anything, and you should simply accept this fact.

Especially when testing different versions of one engine, the small differences in strength will result in a higher winrate than when testing different engines. So for me 50,8% mean that both versions are equal in strength.

q30 wrote:
(and if don't agree, than were is return proof in facts/results)?


I don't want to proof it because I would have to play many thousands of games,

But why don't you take a look at Leela's changelog:

Code:
This is a bugfix release for training game generation.

    Bugfix to Dirichlet noise being more uniform than intended (bug was introduced in v0.13).
    Tweaks to randomized move selection in training games to reduce needless blunders. Added extra options.


https://github.com/gcp/leela-zero/releases

There is NO difference in gameplay for the official Leela and I don't care if there is a tiny difference in some Leela hack.

Top
 Profile  
 
Offline
 Post subject: Re: Engine Tournament
Post #253 Posted: Sat Oct 13, 2018 10:09 am 
Lives with ko

Posts: 135
Liked others: 1
Was liked: 1
Rank: 30 kyu
How I had mentioned above, the difference in strength may be results from difference in performance, maden by changes in Phoenix LeelaZero code.
So, the only proof is "LeelaZero_phoenix_0.15 - LeelaZero_phoenix_0.14: 0 - 4", i.e. Phoenix LeelaZero v.0.14 is stronger, that v.0.15...

Top
 Profile  
 
Offline
 Post subject: Re: Engine Tournament
Post #254 Posted: Sun Oct 14, 2018 12:14 am 
Lives with ko

Posts: 180
Liked others: 15
Was liked: 23
Rank: Beginner
q30 wrote:
How I had mentioned above, the difference in strength may be results from difference in performance, maden by changes in Phoenix LeelaZero code.
So, the only proof is "LeelaZero_phoenix_0.15 - LeelaZero_phoenix_0.14: 0 - 4", i.e. Phoenix LeelaZero v.0.14 is stronger, that v.0.15...


If you roll a dice the chance of a 4-0 is 12,5%. If you play matches between two engines the chance of a 4-0 is >= 12,5% because possible duplicated moves, differences in compilation and of course differences in strength.

Top
 Profile  
 
Offline
 Post subject: Re: Engine Tournament
Post #255 Posted: Sat Oct 20, 2018 4:46 am 
Lives with ko

Posts: 135
Liked others: 1
Was liked: 1
Rank: 30 kyu
Compilation was the same (PC, system soft and compilation parameters).
Because other proofs (with bigger %, i.e. in big plays number matches with GPU enabled) are absent, on this moment the approval "version 14 is stronger, that 15 one" is true...

Top
 Profile  
 
Offline
 Post subject: Re: Engine Tournament
Post #256 Posted: Sat Oct 20, 2018 12:26 pm 
Lives with ko

Posts: 180
Liked others: 15
Was liked: 23
Rank: Beginner
q30 wrote:
Compilation was the same (PC, system soft and compilation parameters).
Because other proofs (with bigger %, i.e. in big plays number matches with GPU enabled) are absent, on this moment the approval "version 14 is stronger, that 15 one" is true...


You can't claim it to be true as long as it is not proven.

Talking about the result of 208:216... If both opponents have the same strength, the chance that one will score at least 216 points in a 424 games match is 73%.

Top
 Profile  
 
Offline
 Post subject: Re: Engine Tournament
Post #257 Posted: Sat Oct 27, 2018 8:06 am 
Lives with ko

Posts: 135
Liked others: 1
Was liked: 1
Rank: 30 kyu
You have got the result 216:208 in favour of Phoenix LZ v.14?
And how many playouts there was in games beginning?

Top
 Profile  
 
Offline
 Post subject: Re: Engine Tournament
Post #258 Posted: Sat Oct 27, 2018 8:19 am 
Lives with ko

Posts: 180
Liked others: 15
Was liked: 23
Rank: Beginner
q30 wrote:
You have got the result 216:208 in favour of Phoenix LZ v.14?
And how many playouts there was in games beginning?


It's from the link you posted.

Top
 Profile  
 
Offline
 Post subject: Re: Engine Tournament
Post #259 Posted: Sat Nov 03, 2018 4:43 am 
Lives with ko

Posts: 135
Liked others: 1
Was liked: 1
Rank: 30 kyu
I had not posted any link to any game results between Phoenix v.14 and v.15, except mine.
So, on this moment the only proof is, that Phoenix LZ v.14 is stronger, than v.15, with (from basics of probability theory) the probability of their equality (1/2)^4 and the probability of stronger v.15 <(1/2)^4, i.e. v.14 is stronger with >87,5%.

Top
 Profile  
 
Offline
 Post subject: Re: Engine Tournament
Post #260 Posted: Sat Nov 03, 2018 11:10 am 
Gosei

Posts: 1590
Liked others: 886
Was liked: 527
Rank: AGA 3k Fox 3d
GD Posts: 61
KGS: dfan
If A and B are equally strong, A will beat B 4-0 with probability 1/16. This doesn't mean that if A beats B 4-0, the probability they are equally strong is 1/16.

Top
 Profile  
 
Display posts from previous:  Sort by  
Post new topic Reply to topic  [ 390 posts ]  Go to page Previous  1 ... 10, 11, 12, 13, 14, 15, 16 ... 20  Next

All times are UTC - 8 hours [ DST ]


Who is online

Users browsing this forum: No registered users and 1 guest


You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot post attachments in this forum

Search for:
Jump to:  
Powered by phpBB © 2000, 2002, 2005, 2007 phpBB Group