Re: Engine Tournament
Posted: Sat Sep 15, 2018 5:59 am
It's even more, around 2.7%as0770 wrote:If one engine scores 55% in a 400 game match, there is still a chance of 1% that it is _not_ stronger.
Life in 19x19. Go, Weiqi, Baduk... Thats the life.
https://lifein19x19.com/
It's even more, around 2.7%as0770 wrote:If one engine scores 55% in a 400 game match, there is still a chance of 1% that it is _not_ stronger.
I didn't argue, that v. 0.14 absolutely stronger than v. 0.15, but most likely it's not weaker. And IMHO if the next version not stronger, than previous, that it's weakness of first one...as0770 wrote:
Or a weight file of similar strength. And once again: If one engine scores 55% in a 400 game match, there is still a chance of 1% that it is _not_ stronger.
If You'll open really page by the link, You'll find the next text (content of Readme.md file):as0770 wrote: Your link points to LeelaZero_PhoenixGo v1 and v2 but you were talking about Leela Zero v0.14 and 0.15. LeelaZero 0.14 don't support Phoenix networks. Results of LeelaZero_PhoenixGo v1 vs. LeelaZero_PhoenixGo v2 with Phoeniox Networks don't say anything about the strength of LeelaZero v0.14 and 0.15. I wouldn't even trust results with Leela networks.
So 'v1' & 'v2' are only the folders (i.e. a parts of link paths) and the engines are: "leela-zero-0.14-phoenix-go" and "leela-zero-0.15-phoenix-go"...LeelaZero_PhoenixGo
LeelaZero + PhoenixGo's weights.
0.15 + lizzie
Source Code
https://github.com/yenw/LeelaZero_Phoen ... lizzie.zip
Weights
V1: https://github.com/yenw/LeelaZero_Phoen ... _v1.txt.gz
Leela Zero
Windows
CPU: https://github.com/yenw/LeelaZero_Phoen ... ie.exe.zip
GPU: https://github.com/yenw/LeelaZero_Phoen ... ie.exe.zip
0.15
Source Code
https://github.com/yenw/LeelaZero_Phoen ... nix-go.zip
Weights
V1: https://github.com/yenw/LeelaZero_Phoen ... _v1.txt.gz
Leela Zero
Windows
CPU: https://github.com/yenw/LeelaZero_Phoen ... pu.exe.zip
GPU: https://github.com/yenw/LeelaZero_Phoen ... cl.exe.zip
0.14
Source Code
https://github.com/yenw/LeelaZero_Phoen ... nix-go.zip
Weights
V1: https://github.com/yenw/LeelaZero_Phoen ... _v1.txt.gz
Leela Zero
Windows
CPU: https://github.com/yenw/LeelaZero_Phoen ... pu.exe.zip
GPU: https://github.com/yenw/LeelaZero_Phoen ... cl.exe.zip
Linux
CPU: https://github.com/yenw/LeelaZero_Phoen ... 64_cpu.zip
GPU: https://github.com/yenw/LeelaZero_Phoen ... opencl.zip
MacOS
CPU: https://github.com/yenw/LeelaZero_Phoen ... 64_cpu.zip
GPU: https://github.com/yenw/LeelaZero_Phoen ... opencl.zip
And these quantitative calculations are true in used conditions (time control, number of playouts and so on) only...Vargo wrote:It's even more, around 2.7%as0770 wrote:If one engine scores 55% in a 400 game match, there is still a chance of 1% that it is _not_ stronger.
You did. Thats what all the discussion is about:q30 wrote:I didn't argue, that v. 0.14 absolutely stronger than v. 0.15,
q30 wrote:LeelaZero 0.14 is a bit stronger (not as much, as in case of Phoenix), than 0.15 (details)...
Nevertheless it is not the same as the official Leela branch, did I already mention that Leela v0.14 doesn't support Phoenix networks?q30 wrote: If You'll open really page by the link, You'll find the next text
I think, You agree, that 0.15 version is not stronger, than 0.14 one. For most usual users it's the same, that new version is weaker: it's no a reason to download it. Have You any result of sparring, where 0.15 version was at most a bit stronger,than 0.14 one?as0770 wrote:You did. Thats what all the discussion is about:q30 wrote:I didn't argue, that v. 0.14 absolutely stronger than v. 0.15,
q30 wrote:LeelaZero 0.14 is a bit stronger (not as much, as in case of Phoenix), than 0.15 (details)...Nevertheless it is not the same as the official Leela branch, did I already mention that Leela v0.14 doesn't support Phoenix networks?q30 wrote: If You'll open really page by the link, You'll find the next text
Don't you read the answers to your posts, do you just ignore them or don't you understand them? That is really ridiculous...q30 wrote:I think, You agree, that 0.15 version is not stronger, than 0.14 one. For most usual users it's the same, that new version is weaker: it's no a reason to download it. Have You any result of sparring, where 0.15 version was at most a bit stronger,than 0.14 one?as0770 wrote:You did. Thats what all the discussion is about:q30 wrote:I didn't argue, that v. 0.14 absolutely stronger than v. 0.15,
q30 wrote:LeelaZero 0.14 is a bit stronger (not as much, as in case of Phoenix), than 0.15 (details)...Nevertheless it is not the same as the official Leela branch, did I already mention that Leela v0.14 doesn't support Phoenix networks?q30 wrote: If You'll open really page by the link, You'll find the next text
You can do yourself 2 matches of v.014 vs v.0.15 with enough for You number of games with a very big time control (10 '/move, for example) and with GPU enabled: 1) with -p 1; 2) with -p 10000. If in second match will more % win of one engine, than it's stronger.
Version 0.14 of Phoenix LeelaZero supports Phoenix neuronets: see results
I simply try to answer questions here. It is somehow frustrating when you take time and effort to explain simple little facts, and the respondent consequently ignores them. I don't know if he is just trolling or if he don't understands anything. The purpose of a public forum is to discuss in public. And btw your post is way more off topic than ours. If you don't like the thread, just ignore it.yakcyll wrote:I'm not a moderator here, but I'm pretty sure I'm not the only one to think this exchange has already gone far enough. If you want to keep throwing ad hominems around, it's best to keep it to private messages; pretty sure this thread was never meant to serve this purpose.
I didn't understand English well. But the same question is for You...as0770 wrote:...
Don't you read the answers to your posts, do you just ignore them or don't you understand them? That is really ridiculous...
(especially in case of Phoenix)...2 matches of v.014 vs v.0.15 with statistically enough number of games with a very big time control (10 '/move, for example) and with GPU enabled: 1) with -p 1; 2) with -p 10000. If in second match will more % win of one engine, than it's stronger (for more reliability one can add the third match with -p 1000 to receive intermediate value of % win).
I didn't say it is not stronger. I said 50,8% winrate is far away from proofing anything, and you should simply accept this fact.q30 wrote:You aren't agree, that Phoenix LeelaZero 0.14 is stronger, than 0.15 one
I don't want to proof it because I would have to play many thousands of games,q30 wrote:(and if don't agree, than were is return proof in facts/results)?
Code: Select all
This is a bugfix release for training game generation.
Bugfix to Dirichlet noise being more uniform than intended (bug was introduced in v0.13).
Tweaks to randomized move selection in training games to reduce needless blunders. Added extra options.If you roll a dice the chance of a 4-0 is 12,5%. If you play matches between two engines the chance of a 4-0 is >= 12,5% because possible duplicated moves, differences in compilation and of course differences in strength.q30 wrote:How I had mentioned above, the difference in strength may be results from difference in performance, maden by changes in Phoenix LeelaZero code.
So, the only proof is "LeelaZero_phoenix_0.15 - LeelaZero_phoenix_0.14: 0 - 4", i.e. Phoenix LeelaZero v.0.14 is stronger, that v.0.15...