AI RYUSEI Tournament 2017, December 9-10

For discussing go computing, software announcements, etc.

And the winner is...

Poll ended at Sat Dec 09, 2017 2:42 am

DeepZenGo
1
33%
FineArt
2
67%
Others
0
No votes
 
Total votes: 3

pookpooi
Lives in sente
Posts: 727
Joined: Sat Aug 21, 2010 12:26 pm
GD Posts: 10
Has thanked: 44 times
Been thanked: 218 times

Re: AI RYUSEI Tournament 2017, December 9-10

Post by pookpooi »

johnsmith wrote:Do you know what are time settings
http://www.igoshogi.net/ai_ryusei/01/en/rules.html
pookpooi
Lives in sente
Posts: 727
Joined: Sat Aug 21, 2010 12:26 pm
GD Posts: 10
Has thanked: 44 times
Been thanked: 218 times

Re: AI RYUSEI Tournament 2017, December 9-10

Post by pookpooi »

As expected (no pun intended), DeepZen meet FineArt in the final. This is basically Alien vs. Predator scenario because they demolish other programs with ease. Good thing FineArt blundered yesterday (Tianrang also) so the programmer can fix, today there's no room for any error anymore. FineArt is the favorite to win but can DeepZen pull an upset just like it won at World AI Go Championship in China?

Update: FineArt has very high chance to win, according to DeepZenGo. But no matter who wins, we lose
Image
pookpooi
Lives in sente
Posts: 727
Joined: Sat Aug 21, 2010 12:26 pm
GD Posts: 10
Has thanked: 44 times
Been thanked: 218 times

Re: AI RYUSEI Tournament 2017, December 9-10

Post by pookpooi »

All second game records are here http://52.198.104.180/secondday.html
User avatar
EdLee
Honinbo
Posts: 8859
Joined: Sat Apr 24, 2010 6:49 pm
GD Posts: 312
Location: Santa Barbara, CA
Has thanked: 349 times
Been thanked: 2070 times

Post by EdLee »

Does anyone happen to know how the developers in Japan (e.g. DeepZen) and outside of Japan (e.g. AlphaZero, FineArt, etc.) feel about implementing area scoring (e.g. AGA's) versus territory scoring ?

( How does Leela do scoring? )
User avatar
Waylon
Dies in gote
Posts: 24
Joined: Sat May 14, 2016 1:30 am
GD Posts: 0
Location: Vienna, Austria
Has thanked: 686 times
Been thanked: 11 times

Re:

Post by Waylon »

EdLee wrote:Does anyone happen to know how the developers in Japan (e.g. DeepZen) and outside of Japan (e.g. AlphaZero, FineArt, etc.) feel about implementing area scoring (e.g. AGA's) versus territory scoring ?

( How does Leela do scoring? )
It seems that Leela uses area scoring.

I used SmartGo to create the four attached game files. One uses Japanese rules and komi 6.5, the other three are with 7.5 komi and Chinese, AGA territory and AGA area rules.

SmartGo's scoring function gives the expected results:
Jap = B+0.5, Chi = B+1.5, AGAter = B+1.5, AGAarea = B+1.5

Leela scores differently only with Japanese rules:
Jap = B+2.5, Chi = B+1.5, AGAter = B+1.5, AGAarea = B+1.5

Zenith Go 7 seems to use always territory scoring, regardless of the RU parameter in the sgf file:
Jap = B+0.5, Chi = W+0.5, AGAter = W+0.5, AGAarea = W+0.5

Crazy Stone Deep Learning gives the correct score for Japanese and Chinese rules, but seems to be wrong with AGA rules:
Jap = B+0.5, Chi = B+1.5, AGAter = W+0.5, AGAarea = W+0.5

(It seems I can only attach three files. You can edit the RU parameter in the sgf file with a text editor)
Attachments
ScoreTestAGAter.sgf
(242 Bytes) Downloaded 655 times
ScoreTestChin.sgf
(228 Bytes) Downloaded 668 times
ScoreTestJap.sgf
(231 Bytes) Downloaded 647 times
moha
Lives in gote
Posts: 311
Joined: Wed May 31, 2017 6:49 am
Rank: 2d
GD Posts: 0
Been thanked: 45 times

Re: Re:

Post by moha »

EdLee wrote:how the developers in Japan (e.g. DeepZen) and outside of Japan (e.g. AlphaZero, FineArt, etc.) feel about implementing area scoring (e.g. AGA's) versus territory scoring ?
AFAIK most programs use area scoring internally, and I recall Zen authors mentioning that for Japanese rules they set an internal komi to ensure that Zen targets a board score that wins in all cases. Still there were some problems with this IIRC, cases where Zen played the usual nonsense endgame, throwing away points thinking it could still win by 0.5, where in fact it lost. Don't remember what went wrong though (OC this approach won't work with one-sided dame for example, but that wasn't the problem in practice).
Waylon wrote:I used SmartGo to create the four attached game files. One uses Japanese rules and komi 6.5, the other three are with 7.5 komi and Chinese, AGA territory and AGA area rules.
The number of B and W stones differs here without indicating captures, this further complicates things.
Uberdude
Judan
Posts: 6727
Joined: Thu Nov 24, 2011 11:35 am
Rank: UK 4 dan
GD Posts: 0
KGS: Uberdude 4d
OGS: Uberdude 7d
Location: Cambridge, UK
Has thanked: 436 times
Been thanked: 3718 times

Re: AI RYUSEI Tournament 2017, December 9-10

Post by Uberdude »

I presumed Ed's question was about how do the various go playing engines score the board internally as part of their algorithms for deciding what move to play, and I believe the answer for pretty much every bot is area scoring as that's easier and allows them to fill in their own territory at no cost. The user interfaces associated with various bots implement territory scoring because that's what their human users like.

I seem to recall when DeepZen messed up in the WGC that was blamed on its value network being trained on Chinese/area counting/komi but the tournament being Japanese so it gave away too much. Also didn't the AlphaGi Zero paper say they used tromp taylor for counting.

Edit. Yup. From p22.
2. AlphaGo Zero uses Tromp-Taylor scoring during MCTS simulations and self-play training. This is because human scores (Chinese, Japanese or Korean rules) are not well-defined if the game terminates before territorial boundaries are resolved. However, all tournament and evaluation games were scored using Chinese rules.
Bill Spight
Honinbo
Posts: 10905
Joined: Wed Apr 21, 2010 1:24 pm
Has thanked: 3651 times
Been thanked: 3373 times

Re: AI RYUSEI Tournament 2017, December 9-10

Post by Bill Spight »

Uberdude wrote:I presumed Ed's question was about how do the various go playing engines score the board internally as part of their algorithms for deciding what move to play, and I believe the answer for pretty much every bot is area scoring as that's easier and allows them to fill in their own territory at no cost. The user interfaces associated with various bots implement territory scoring because that's what their human users like.

I seem to recall when DeepZen messed up in the WGC that was blamed on its value network being trained on Chinese/area counting/komi but the tournament being Japanese so it gave away too much. Also didn't the AlphaGi Zero paper say they used tromp taylor for counting.

Edit. Yup. From p22.
2. AlphaGo Zero uses Tromp-Taylor scoring during MCTS simulations and self-play training. This is because human scores (Chinese, Japanese or Korean rules) are not well-defined if the game terminates before territorial boundaries are resolved. However, all tournament and evaluation games were scored using Chinese rules.
Using Tromp-Taylor scoring, which counts only empty points surrounded by one color as territory, might explain a persistent bias towards taking territory fairly early. If the game ends before territorial boundaries are resolved, which will often happen in the early stages of the development of the program, empty points in areas that are not completely surrounded count for naught. Proximity scoring, OTOH, counts empty points according to the color of the closest stone. The score for games that end prematurely then rewards influence over territory. Using proximity scoring might produce a persistent bias towards making influence early. :)
The Adkins Principle:
At some point, doesn't thinking have to go on?
— Winona Adkins

Visualize whirled peas.

Everything with love. Stay safe.
User avatar
EdLee
Honinbo
Posts: 8859
Joined: Sat Apr 24, 2010 6:49 pm
GD Posts: 312
Location: Santa Barbara, CA
Has thanked: 349 times
Been thanked: 2070 times

Post by EdLee »

Actually, the operative word is feel:

How do the developers in and out of Japan feel about the different rules sets ?

The question is about the humans, not the algorithms. :)
pookpooi
Lives in sente
Posts: 727
Joined: Sat Aug 21, 2010 12:26 pm
GD Posts: 10
Has thanked: 44 times
Been thanked: 218 times

Re: AI RYUSEI Tournament 2017, December 9-10

Post by pookpooi »

Uberdude wrote:So for which bot was 'define' a test account? :)
Define is Tianrang?????

https://twitter.com/ohashihirofumi/stat ... 0502737920
Uberdude
Judan
Posts: 6727
Joined: Thu Nov 24, 2011 11:35 am
Rank: UK 4 dan
GD Posts: 0
KGS: Uberdude 4d
OGS: Uberdude 7d
Location: Cambridge, UK
Has thanked: 436 times
Been thanked: 3718 times

Re: AI RYUSEI Tournament 2017, December 9-10

Post by Uberdude »

pookpooi wrote:Define is Tianrang?????
Well, it certainly liked the 3-3!

Post Reply