It is currently Tue Apr 30, 2024 2:19 am

All times are UTC - 8 hours [ DST ]




Post new topic Reply to topic  [ 216 posts ]  Go to page Previous  1, 2, 3, 4, 5, 6, 7 ... 11  Next
Author Message
Offline
 Post subject: Re: KataGo V1.3
Post #61 Posted: Sun Feb 09, 2020 3:55 am 
Dies in gote

Posts: 62
Liked others: 8
Was liked: 4
lightvector,

When I shift between the 20 block and the 15 block nets is it recommended to run the benchmark and tuner command again?

Top
 Profile  
 
Offline
 Post subject: Re: KataGo V1.3
Post #62 Posted: Sun Feb 09, 2020 7:36 am 
Lives in sente

Posts: 757
Liked others: 114
Was liked: 916
Rank: maybe 2d
By default, it automatically should, right? So yes, I would say it's recommended.

Although, I don't know how much of a difference it makes, truthfully, compared to if you were to manually rename one tuner file so that the other one would use the same tuning it as well.

Top
 Profile  
 
Offline
 Post subject: Re: KataGo V1.3
Post #63 Posted: Sun Feb 09, 2020 10:48 am 
Dies in gote

Posts: 62
Liked others: 8
Was liked: 4
I think I misunderstood the tuning thing...

But on my GT650M KataGo is twice as fast after the automatic tuning than if I run the tuner explicit.

On my better GPUs that is not the case.

Top
 Profile  
 
Offline
 Post subject: Re: KataGo V1.3
Post #64 Posted: Mon Feb 10, 2020 4:15 pm 
Lives with ko

Posts: 150
Liked others: 200
Was liked: 30
Rank: 25 kyu
With Leela Zero maxing out, I wonder what about katago? Is the 20b still improving near the incredible pace it has been? Will there be many more months or even a year more of this? This go bot has been an amazing thing

_________________
set katago to play at your level https://docdro.id/sHZU1ti or experiment with gtp4zen ( https://rb.gy/kx2ilb )

Top
 Profile  
 
Offline
 Post subject: Re: KataGo V1.3
Post #65 Posted: Mon Feb 10, 2020 5:32 pm 
Lives in sente

Posts: 757
Liked others: 114
Was liked: 916
Rank: maybe 2d
KataGo's run will continue! We're moving to 30 and 40 blocks and will continue training the 20 block to be stronger as well.

Sadly, the run has been temporarily paused for the last week. For various reasons, I need to transfer the ongoing training to a different cloud account instead, but in the new account Google Cloud support has refused to allocate and approve the necessary limits to resume the run. I'm attempting to get this sorted out. Ugh bureaucracy. :study: :)


This post by lightvector was liked by 6 people: Codexus, dfan, ez4u, go4thewin, Sneegurd, Vargo
Top
 Profile  
 
Offline
 Post subject: Re: KataGo V1.3
Post #66 Posted: Wed Feb 12, 2020 4:42 am 
Lives in gote

Posts: 337
Liked others: 22
Was liked: 97
100-game test at visits parity (1600 visits for both)
gogui-twogtp 1.5.1, no error, no duplicate game , average time : 204s/game for LZ, 213 s/game for KG.

KataGo 1.3.2 (b20-s19-d43) v. LZ017(#263)
KataGo wins 72-28 (72%) all games by resignation.

stats :
Attachment:
k263.jpg
k263.jpg [ 462.82 KiB | Viewed 8334 times ]
The games :
Attachment:
KG132_v1600_LZ263_v1600.rar [85.99 KiB]
Downloaded 326 times


10-game test :
KataGo 1.3.2 (b20-s19-d43) MaxPlayouts=1, numSearchThreads=1
v.
gtp4zen7 -t1 -r20 -s4000 -n3 -o1.4 -p0.7 (8D, according to this site )

gtp4zen 8D wins 8-2 against KataGo at ... 1 playout

Top
 Profile  
 
Offline
 Post subject: Re: KataGo V1.3
Post #67 Posted: Wed Feb 12, 2020 5:46 am 
Dies in gote

Posts: 25
Liked others: 0
Was liked: 7
KGS: inbae
Here is the benchmark result for short time control:

Code:
KataGo vs Leela Zero STC benchmark - fixed playouts

    KataGo: g170-20x256-1.91G, 60 threads, 36000 playouts
Leela Zero: #263, 20 threads, 20000 playouts
   Configs: Resign threshold = 10%, pondering off, temperature = 0
  Hardware: 2 x RTX2080Ti
  Openings: 4-move opening book (3-3 included)

              Black          White          Total
KataGo       40 (61.5%)     51 (78.5%)     91 (70.0%)
Leela Zero   14 (21.5%)     25 (38.5%)     39 (30.0%)
Total        54 (41.5%)     65 (58.5%)     130

Elo rating difference: +147.2 +- 55.0


I halted this benchmark due to the clear statistical significance, and now a new benchmark with 1m per move (~200k playouts for LZ and ~350k playouts for KataGo) is running, with the current status being KG 7-1 LZ.

Top
 Profile  
 
Offline
 Post subject: Re: KataGo V1.3
Post #68 Posted: Wed Feb 12, 2020 5:56 am 
Lives in gote

Posts: 591
Location: Adelaide, South Australia
Liked others: 211
Was liked: 266
Rank: Australian 2 dan
GD Posts: 200
Vargo wrote:
10-game test :
KataGo 1.3.2 (b20-s19-d43) MaxPlayouts=1...

Why???

Top
 Profile  
 
Offline
 Post subject: Re: KataGo V1.3
Post #69 Posted: Wed Feb 12, 2020 6:24 am 
Lives in gote

Posts: 337
Liked others: 22
Was liked: 97
xela wrote:
Why???
To see how the "raw network" performs.

Top
 Profile  
 
Offline
 Post subject: Re: KataGo V1.3
Post #70 Posted: Thu Feb 13, 2020 9:34 am 
Lives in gote

Posts: 337
Liked others: 22
Was liked: 97
20-game test :
KataGo 1.3.2 (b20-s19-d43) MaxPlayouts=1, numSearchThreads=1
v.
gtp4zen7 -t1 -s2400 -n2 -o1.5 -p0.5 (strength : 4d)


KataGo wins 19-1 (all games by resignation, average time : ~0.5s/game for Katago !)

twogtp stats (KG always appears as W, because of the command -alternate)
Attachment:
k1.jpg
k1.jpg [ 402.93 KiB | Viewed 8232 times ]
The raw network is maybe somewhere between 4d and 8d ?


This post by Vargo was liked by: go4thewin
Top
 Profile  
 
Offline
 Post subject: Re: KataGo V1.3
Post #71 Posted: Fri Feb 14, 2020 9:35 am 
Lives with ko

Posts: 150
Liked others: 200
Was liked: 30
Rank: 25 kyu
4 games between katago 1.3.2 s191 at max 400 playouts 1 thread against the weakest bot I could find that beat a pro over a real board, leela zero 125 at 4000 playouts (https://lifein19x19.com/viewtopic.php?f=18&t=15631). I used 1 thread for leelaz as well. Katago's games as white were very close, but katago won all 4, 2 white and 2 black. So it is maybe 9 dan amateur at 400 playouts.

katago 1.3.2 s191 1 thread 400 po 4 wins
lz (9dan) 125 1 thread 4000 po 0 wins
Katago 1.3.2 s191 1 thread 16 playouts vs gtp4zen 9d : 4-0
katago 1.3.2 15b s167 1 thread 70 playouts vs s191 16po : 4-0
katago 1.3.2 10b s114 1 thread 512 playouts vs s167 70po :2-2
Katago 1.3.2 s191 1 thread 1 playouts vs zen6 gtp4zen 7d 3:1
Katago's strength starts at the same level zen6 gtp4zen ends, so combining the two in a gui can give you a wide range of playing strengths to choose from for any board size and either ruleset. If you want lower playing strength (high ddk) than zen6, you can use zen7 withe less playouts. Sample 17k-9d (https://bit.ly/2TbYGeY)

_________________
set katago to play at your level https://docdro.id/sHZU1ti or experiment with gtp4zen ( https://rb.gy/kx2ilb )


Last edited by go4thewin on Sun Feb 23, 2020 5:31 am, edited 13 times in total.
Top
 Profile  
 
Offline
 Post subject: Re: KataGo V1.3
Post #72 Posted: Sat Feb 15, 2020 9:00 am 
Lives in gote

Posts: 337
Liked others: 22
Was liked: 97
Not statistically significant, either : 100 game tests

KG-100po v KG-1po : 100-0
KG-200po v KG 100po : 91-9
KG-400po v KG-200po : 87-13
KG-800po v KG-400po : 81-19
(and KG-400po v KG-100po : 100-0)

KGv1.3.2, network g170-b20c256x2-s1913382912-d435450331, all games by resignation, no error, no duplicate game, gogui-twogtp 1.5.1
Doubling the playouts seems to give a 80-90% winrate, maybe less than that for high numbers of playouts ?

Some stats (highest po always appears as B, because of -alternate)
Attachment:
2.jpg
2.jpg [ 235.5 KiB | Viewed 8067 times ]
Attachment:
4.jpg
4.jpg [ 245.59 KiB | Viewed 8067 times ]
Attachment:
5.jpg
5.jpg [ 232.7 KiB | Viewed 8067 times ]


This post by Vargo was liked by: go4thewin
Top
 Profile  
 
Offline
 Post subject: Re: KataGo V1.3
Post #73 Posted: Sun Feb 16, 2020 6:46 am 
Lives in sente

Posts: 757
Liked others: 114
Was liked: 916
Rank: maybe 2d
Some bureaucracy has been navigated finally now, and after a pause of about 2 weeks, KataGo's training run has resumed!


This post by lightvector was liked by 7 people: Bill Spight, Codexus, ez4u, go4thewin, Limeztone, Sneegurd, xela
Top
 Profile  
 
Offline
 Post subject: Re: KataGo V1.3
Post #74 Posted: Fri Feb 21, 2020 7:57 pm 
Dies in gote

Posts: 25
Liked others: 0
Was liked: 7
KGS: inbae
KataGo vs Leela Zero VLTC benchmark is completed.

Code:
Engine 1: KataGo v1.3.2, g170-20x256-1.91G
Engine 2: Leela Zero v0.17, #263
Hardware: 2xRTX2080Ti
Time control: 1 minute per move

              Black          White          Total
KataGo       25 (67.6%)     30 (81.1%)     55 (74.3%)
Leela Zero    7 (18.9%)     12 (32.4%)     19 (25.7%)
Total        32 (43.2%)     42 (56.8%)     74

Elo rating difference: +184.6 +- 69.6

Diagonal openings: KG 22-4 LZ, Elo: +296.1 +- 96.4
Parallel openings: KG 33-15 LZ, Elo: +137.0 +- 91.1


Game records can be downloaded here. These include SGF files commented with winrates and scores, with PV's expanded as branches, and image files plotting winrates and scores. I hope these can also help for studying purposes.

The major difference between diagonal and parallel openings lies in that the "flying dagger" joseki, which KataGo does not understand very well, is only playable with parallel openings due to ladders.


Attachments:
flying_dagger.png
flying_dagger.png [ 214.62 KiB | Viewed 7804 times ]

This post by inbae was liked by 2 people: go4thewin, xela
Top
 Profile  
 
Offline
 Post subject: Re: KataGo V1.3
Post #75 Posted: Sat Feb 22, 2020 9:51 am 
Lives with ko

Posts: 150
Liked others: 200
Was liked: 30
Rank: 25 kyu
When specifying max 1 search thread in the config file, is the 32 bit version the same strength at equal (low) playouts as the 64 bit version? Thanks!

_________________
set katago to play at your level https://docdro.id/sHZU1ti or experiment with gtp4zen ( https://rb.gy/kx2ilb )

Top
 Profile  
 
Offline
 Post subject: Re: KataGo V1.3
Post #76 Posted: Sat Feb 22, 2020 7:34 pm 
Lives in gote

Posts: 591
Location: Adelaide, South Australia
Liked others: 211
Was liked: 266
Rank: Australian 2 dan
GD Posts: 200
inbae wrote:
KataGo vs Leela Zero VLTC benchmark is completed.

Thanks, this is very cool! They're now on my (very long) list of game records I want to study. I probably won't get through all 74 though. Are there any games that stand out as highlights?

Including the winrate/score graphs in the download is a nice touch.

So 1 minute per move gives usually a few hundred thousand playouts. I guess the numbers vary a lot depending on the amount of branching in the tree and the amount of tree reuse between moves.

I notice that the first four moves of each game have the word "book" as a comment. You assigned specific opening positions then got the AIs to play from move 5 onwards? How did you choose the openings?

Top
 Profile  
 
Offline
 Post subject: Re: KataGo V1.3
Post #77 Posted: Sun Feb 23, 2020 2:49 am 
Dies in gote

Posts: 25
Liked others: 0
Was liked: 7
KGS: inbae
xela wrote:
Are there any games that stand out as highlights?

I spectated games when the engine were playing each other, and most of them were very interesting to me. Of course there are some silly games - you can detect them by skimming through winrate graphs and checking if there is a sudden change of winrate. Otherwise I suppose all games are worth studying.

xela wrote:
So 1 minute per move gives usually a few hundred thousand playouts. I guess the numbers vary a lot depending on the amount of branching in the tree and the amount of tree reuse between moves.

Of course they do, though tree reuse affects visits, not playouts.

xela wrote:
I notice that the first four moves of each game have the word "book" as a comment. You assigned specific opening positions then got the AIs to play from move 5 onwards? How did you choose the openings?

I'm using an opening book to avoid semi-duplicate games while not weakening engines with randomness. I attach three sets opening books I made:

openings.zip: All combinations of 4-move openings with 4-4, 4-3 and 3-3, unique under symmetry operations.
openings_small.zip: Same as above, but without 3-3.
openings_verysmall.zip: A handcrafted subset of openings_small.zip.

The smallest opening book was used for this benchmark.


Attachments:
File comment: A handcrafted subset of openings_small.zip.
openings_verysmall.zip [6.86 KiB]
Downloaded 343 times
File comment: Same as openings.zip, but without 3-3.
openings_small.zip [12.77 KiB]
Downloaded 332 times
File comment: All combinations of 4-move openings with 4-4, 4-3 and 3-3, unique under symmetry operations.
openings.zip [39.22 KiB]
Downloaded 352 times

This post by inbae was liked by: xela
Top
 Profile  
 
Offline
 Post subject: Re: KataGo V1.3
Post #78 Posted: Sun Feb 23, 2020 4:18 am 
Gosei

Posts: 1733
Location: Earth
Liked others: 621
Was liked: 310
What is "the" flying dagger joseki.

It should be "a" flying dagger joseki in my opinion. One of many, that is.

Nevertheless I have fun with your variant of a flying dagger joseki, although trick moves are not my thing usually. But with this one, if the flying dagger misses its target, it is at least not catastrophic.

Top
 Profile  
 
Offline
 Post subject: Re: KataGo V1.3
Post #79 Posted: Sun Feb 23, 2020 4:23 am 
Gosei

Posts: 1733
Location: Earth
Liked others: 621
Was liked: 310
And a pro game, the variation seems to be difficult even for 9d professionals:


Top
 Profile  
 
Offline
 Post subject: Re: KataGo V1.3
Post #80 Posted: Sun Feb 23, 2020 4:30 am 
Gosei

Posts: 1733
Location: Earth
Liked others: 621
Was liked: 310
example variation
Click Here To Show Diagram Code
[go]$$W
$$ ---------------------------------------
$$ | . . . . . . . . . . . . . . . . . . . |
$$ | . . . . . . . . . . . . . . X . . . . |
$$ | . . . . . X . . . . . . . . . X O . . |
$$ | . . . O . . . . . , . . X O O X O . . |
$$ | . . . . . . . . . . . . O X X O O 8 . |
$$ | . . . . . . . . . . . . . 2 1 X X O . |
$$ | . . . . . . . . . . . . . . 3 4 6 X 9 |
$$ | . . . . . . . . . . . . . . . 5 . 7 . |
$$ | . . . . . . . . . . . . . . . . . . . |
$$ | . . . , . . . . . , . . . . . , . . . |
$$ | . . . . . . . . . . . . . . . . . . . |
$$ | . . . . . . . . . . . . . . . . . . . |
$$ | . . . . . . . . . . . . . . . . . . . |
$$ | . . O . . . . . . . . . . . . . . . . |
$$ | . . . . . . . . . . . . . . . . . . . |
$$ | . . . O . . . . . , . . . . . X . . . |
$$ | . . . . . X . . . . . . . . . . . . . |
$$ | . . . . . . . . . . . . . . . . . . . |
$$ | . . . . . . . . . . . . . . . . . . . |
$$ ---------------------------------------[/go]

Top
 Profile  
 
Display posts from previous:  Sort by  
Post new topic Reply to topic  [ 216 posts ]  Go to page Previous  1, 2, 3, 4, 5, 6, 7 ... 11  Next

All times are UTC - 8 hours [ DST ]


Who is online

Users browsing this forum: Google [Bot] and 1 guest


You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot post attachments in this forum

Search for:
Jump to:  
Powered by phpBB © 2000, 2002, 2005, 2007 phpBB Group