KataGo Distributed Training and new networks
-
And
- Gosei
- Posts: 1464
- Joined: Tue Sep 25, 2018 10:28 am
- GD Posts: 0
- Has thanked: 212 times
- Been thanked: 215 times
Re: KataGo Distributed Training and new networks
BadukAI s555 1p - Sabaki b40c384 1p 1:0
- Attachments
-
- s555 - 40x384.sgf
- (1.08 KiB) Downloaded 8192 times
-
go4thewin
- Lives with ko
- Posts: 150
- Joined: Thu Jan 23, 2020 6:09 am
- Rank: 25 kyu
- GD Posts: 0
- Has thanked: 200 times
- Been thanked: 30 times
Re: KataGo Distributed Training and new networks
katago 40b s558 1po vs golaxy 9d : 1-0
b+r
http://go.ba.net/playgo/go-embed.html?s ... f&move=151
katago 40b s5675 1po vs KGS Hirabot 6d (10,000 po?)
b+r
https://kifu.io/view/t3ArnpUDdJDzfWjdU4x2
b+r
http://go.ba.net/playgo/go-embed.html?s ... f&move=151
katago 40b s5675 1po vs KGS Hirabot 6d (10,000 po?)
b+r
https://kifu.io/view/t3ArnpUDdJDzfWjdU4x2
- Attachments
-
- Golaxy.sgf
- (1.01 KiB) Downloaded 3038 times
Last edited by go4thewin on Fri Jan 15, 2021 3:24 pm, edited 1 time in total.
-
And
- Gosei
- Posts: 1464
- Joined: Tue Sep 25, 2018 10:28 am
- GD Posts: 0
- Has thanked: 212 times
- Been thanked: 215 times
Re: KataGo Distributed Training and new networks
KataGo started winning with black. BadukAI s560 1p - CS Zero 9d 1:0
after 231 moves, CS Zero showed Black's winrate 12.5
after 231 moves, CS Zero showed Black's winrate 12.5
- Attachments
-
- BadukAI - CS Zero.sgf
- (1.77 KiB) Downloaded 8102 times
-
And
- Gosei
- Posts: 1464
- Joined: Tue Sep 25, 2018 10:28 am
- GD Posts: 0
- Has thanked: 212 times
- Been thanked: 215 times
Re: KataGo Distributed Training and new networks
CS Zero has no chance against the new s567 network! KataGo s567 1p - CS Zero 9d:
- Attachments
-
- KataGo - CS Zero.sgf
- (1.55 KiB) Downloaded 7738 times
-
And
- Gosei
- Posts: 1464
- Joined: Tue Sep 25, 2018 10:28 am
- GD Posts: 0
- Has thanked: 212 times
- Been thanked: 215 times
Re: KataGo Distributed Training and new networks
KataGo s572 playouts 1, komi 0.5 - CS Zero 9d:
- Attachments
-
- KataGo - CS Zero.sgf
- (1.01 KiB) Downloaded 7920 times
-
go4thewin
- Lives with ko
- Posts: 150
- Joined: Thu Jan 23, 2020 6:09 am
- Rank: 25 kyu
- GD Posts: 0
- Has thanked: 200 times
- Been thanked: 30 times
Re: KataGo Distributed Training and new networks
I hope s572 or s570 is the net that gets optimized. It's strong enough now that no/few users will have to play against more than one playout. Great for a mobile app!
Just like against cs zero 9d, It also beat the strongest 7d rank on zen 6 (gui version, I have not tried gtp4zen). Using 1 playout, playing as white with no komi, zen resigned after 200 moves; s572 was leading by >70. zen 6 is a strong program that many dan players love to play against.
http://go.ba.net/playgo/go.html?sgf=3mDscV5Cq.sgf
https://ibb.co/GFTLG7j
Just like against cs zero 9d, It also beat the strongest 7d rank on zen 6 (gui version, I have not tried gtp4zen). Using 1 playout, playing as white with no komi, zen resigned after 200 moves; s572 was leading by >70. zen 6 is a strong program that many dan players love to play against.
http://go.ba.net/playgo/go.html?sgf=3mDscV5Cq.sgf
https://ibb.co/GFTLG7j
-
And
- Gosei
- Posts: 1464
- Joined: Tue Sep 25, 2018 10:28 am
- GD Posts: 0
- Has thanked: 212 times
- Been thanked: 215 times
Re: KataGo Distributed Training and new networks
KataGo s572 playouts 1 - Zenith 7 9d:
- Attachments
-
- KataGo - Zenith 7.sgf
- (1.59 KiB) Downloaded 2173 times
-
And
- Gosei
- Posts: 1464
- Joined: Tue Sep 25, 2018 10:28 am
- GD Posts: 0
- Has thanked: 212 times
- Been thanked: 215 times
Re: KataGo Distributed Training and new networks
with the same time settings, CS Zero plays stronger
CS Zero 10 sec - Zenith 7 10 sec 14:6
CS Zero 60 sec - Zenith 7 60 sec 12:8
but when playing 9d it seemed to me that Zenith 7 is stronger. KataGo s572 playouts 1 without Komi could not win against Zenith 7
CS Zero 10 sec - Zenith 7 10 sec 14:6
CS Zero 60 sec - Zenith 7 60 sec 12:8
but when playing 9d it seemed to me that Zenith 7 is stronger. KataGo s572 playouts 1 without Komi could not win against Zenith 7
-
lightvector
- Lives in sente
- Posts: 759
- Joined: Sat Jun 19, 2010 10:11 pm
- Rank: maybe 2d
- GD Posts: 0
- Has thanked: 114 times
- Been thanked: 916 times
Re: KataGo Distributed Training and new networks
New distributed training run should be now open for anyone to connect!
viewtopic.php?f=9&t=18019
https://www.reddit.com/r/baduk/comments ... ributions/
viewtopic.php?f=9&t=18019
https://www.reddit.com/r/baduk/comments ... ributions/
- wineandgolover
- Lives in sente
- Posts: 866
- Joined: Sun Jul 25, 2010 6:05 am
- GD Posts: 0
- Has thanked: 318 times
- Been thanked: 345 times
Re: KataGo Distributed Training and new networks
Hi lightvector,
Are any of your contributors using Mac? I’m happy to help, but I'm probably a bad choice for a pioneer. My GPU's aren’t bad though.
Are any of your contributors using Mac? I’m happy to help, but I'm probably a bad choice for a pioneer. My GPU's aren’t bad though.
- Brady
Want to see videos of low-dan mistakes and what to learn from them? Brady's Blunders
Want to see videos of low-dan mistakes and what to learn from them? Brady's Blunders
-
hakuseki
- Dies with sente
- Posts: 100
- Joined: Thu Oct 29, 2020 6:18 am
- Rank: KGS 2 dan
- GD Posts: 0
- KGS: hakuseki
- Been thanked: 15 times
Re: KataGo Distributed Training and new networks
This is just an idle thought, but would it make sense to increase the number of visits in self-play games in the later stages of a training run (i.e. now)?
My thought is that reinforcement learning will converge to a plateau where the approximation loss (e.g. the trained policy is 400 elo points weaker than the player it is approximating) equals the gain from tree search (e.g. 600 visits is 400 elo points stronger than 1 visit).
To move past the plateau it's necessary to either improve the approximator (e.g. by increasing network size) or approximate a stronger player (e.g. by increasing visits in search). I know KataGo already uses the former, but what about the latter?
My thought is that reinforcement learning will converge to a plateau where the approximation loss (e.g. the trained policy is 400 elo points weaker than the player it is approximating) equals the gain from tree search (e.g. 600 visits is 400 elo points stronger than 1 visit).
To move past the plateau it's necessary to either improve the approximator (e.g. by increasing network size) or approximate a stronger player (e.g. by increasing visits in search). I know KataGo already uses the former, but what about the latter?
-
lightvector
- Lives in sente
- Posts: 759
- Joined: Sat Jun 19, 2010 10:11 pm
- Rank: maybe 2d
- GD Posts: 0
- Has thanked: 114 times
- Been thanked: 916 times
Re: KataGo Distributed Training and new networks
Unfortunately I don't own a Mac, and don't have a great way to compile and test myself. I expect probably some people are using Mac for this, but if so, it's mainly going to be through building from source on their own. Which probably is not *too* hard, since (unlike Windows) I think package management and installation of libraries on Mac is pretty easy, similar to Linux? Things are a bit wordy here: https://github.com/lightvector/KataGo/b ... mpiling.md but the actual process is not hard (except on Windows).wineandgolover wrote:Hi lightvector,
Are any of your contributors using Mac? I’m happy to help, but I'm probably a bad choice for a pioneer. My GPU's aren’t bad though.
Right now the release process is also a bit time-consuming for me, to compile everything on my own personal computers (windows and linux) and package stuff up. If someone were willing to put in the effort to help set up a more automated build process for releases, which could maybe include Mac as well, that would be fantastic.
-
And
- Gosei
- Posts: 1464
- Joined: Tue Sep 25, 2018 10:28 am
- GD Posts: 0
- Has thanked: 212 times
- Been thanked: 215 times
Re: KataGo Distributed Training and new networks
KataGo s581 playouts 1, komi 0 - Zenith 7 9d:
- Attachments
-
- KataGo - Zenith 7.sgf
- (1.4 KiB) Downloaded 1980 times