It is currently Thu Apr 25, 2024 6:25 pm

All times are UTC - 8 hours [ DST ]




Post new topic Reply to topic  [ 9 posts ] 
Author Message
Offline
 Post subject: AlphaGo Zero Patterns
Post #1 Posted: Mon Oct 23, 2017 10:44 pm 
Lives in gote

Posts: 388
Liked others: 416
Was liked: 198
I wrote an article about a few patterns I noticed in AlphaGo Zero's published games: http://361points.com/articles/alphago_zero_patterns/


This post by sorin was liked by 2 people: Calvin Clark, Kirby
Top
 Profile  
 
Offline
 Post subject: Re: AlphaGo Zero Patterns
Post #2 Posted: Mon Oct 23, 2017 11:01 pm 
Judan

Posts: 6725
Location: Cambridge, UK
Liked others: 436
Was liked: 3719
Rank: UK 4 dan
KGS: Uberdude 4d
OGS: Uberdude 7d
Nice article, but I don't think not playing the wedge in double approach is a blind spot for humans: I've had a general preference for corner hane for some years now and that's based on my understanding that plenty of human pros (particularly Chinese?) have preferred it since quite some time before AlphaGo existed.

Also 7 in pattern 1 rather than direct capture surprised me. Does it fear Black tenuki if capture so loses a little extra points to be sure of sente? I suppose p14 is also a possible shape followup and it creates more ko threats.


This post by Uberdude was liked by: sorin
Top
 Profile  
 
Offline
 Post subject: Re: AlphaGo Zero Patterns
Post #3 Posted: Tue Oct 24, 2017 12:40 am 
Judan

Posts: 6162
Liked others: 0
Was liked: 789
I am only surprised by one move:

Click Here To Show Diagram Code
[go]$$B
$$ ------------------
$$ . . . . . . . O . |
$$ . . . . . . O X . |
$$ . . . O O O X O 1 |
$$ . . O X O X X X . |
$$ . . . X . X . . . |
$$ . . . . . X O . . |
$$ . . . . . . . . . |
$$ . . . . . . . . . |
$$ . . . . . . . . . |
$$ . . . . . . O . . |
$$ . . . . . . . . . |[/go]


At first, I thought that the following provides better eye shape and better endgame:

Click Here To Show Diagram Code
[go]$$B
$$ ------------------
$$ . . . . . . . O . |
$$ . . . . . . O X 1 |
$$ . . . O O O X O . |
$$ . . O X O X X X . |
$$ . . . X . X . . . |
$$ . . . . . X O . . |
$$ . . . . . . . . . |
$$ . . . . . . . . . |
$$ . . . . . . . . . |
$$ . . . . . . O . . |
$$ . . . . . . . . . |[/go]


But maybe AlphaGo Zero jugdes that this later attack is more relevant:

Click Here To Show Diagram Code
[go]$$B Tenuki 3
$$ ------------------
$$ . . . . . . . O . |
$$ . . . . . . O X 1 |
$$ . . . O O O X O . |
$$ . . O X O X X X 4 |
$$ . . . X . X . 2 . |
$$ . . . . . X O . . |
$$ . . . . . . . . . |
$$ . . . . . . . . . |
$$ . . . . . . . . . |
$$ . . . . . . O . . |
$$ . . . . . . . . . |[/go]


EDITS


This post by RobertJasiek was liked by: sorin
Top
 Profile  
 
Offline
 Post subject: Re: AlphaGo Zero Patterns
Post #4 Posted: Tue Oct 24, 2017 2:29 am 
Judan

Posts: 6725
Location: Cambridge, UK
Liked others: 436
Was liked: 3719
Rank: UK 4 dan
KGS: Uberdude 4d
OGS: Uberdude 7d
Robert, for interest, there was a somewhat similar shape in AG Master self-play #38 (see viewtopic.php?p=221574#p221574) and in that case it connected rather than capture on move 33, I think because it didn't like the 2-1 throw-in ko.

Top
 Profile  
 
Offline
 Post subject: Re: AlphaGo Zero Patterns
Post #5 Posted: Tue Oct 24, 2017 7:11 pm 
Lives in gote

Posts: 388
Liked others: 416
Was liked: 198
Uberdude wrote:
Nice article, but I don't think not playing the wedge in double approach is a blind spot for humans: I've had a general preference for corner hane for some years now and that's based on my understanding that plenty of human pros (particularly Chinese?) have preferred it since quite some time before AlphaGo existed.


Indeed, the popularity of the wedge vs hane declined in time, but it still appears in pro games even in 2016. AlphaGo seems to a have much stronger opinion about it :-)


Uberdude wrote:
Also 7 in pattern 1 rather than direct capture surprised me. Does it fear Black tenuki if capture so loses a little extra points to be sure of sente? I suppose p14 is also a possible shape followup and it creates more ko threats.


Maybe it likes that it gets some forcing moves from the outside - like P15 - or just in general the fact that black ends up with one extra group (in theory)?

(By the way, moves 5 and 7 are interchangeable - in one of the games it plays at 7 first).

I looked this up in my SGF collection and found only one human game with a similar starting pattern, except that white's approach move at R12 is missing (there is one two spaces down instead); the one "in the corner" is some Inoue Ichiro, while "outside" is Go Seigen, who simply captured the stone like you suggested.

Top
 Profile  
 
Offline
 Post subject: Re: AlphaGo Zero Patterns
Post #6 Posted: Tue Oct 24, 2017 7:13 pm 
Lives in gote

Posts: 388
Liked others: 416
Was liked: 198
RobertJasiek wrote:
But maybe AlphaGo Zero jugdes that this later attack is more relevant:

Click Here To Show Diagram Code
[go]$$B Tenuki 3
$$ ------------------
$$ . . . . . . . O . |
$$ . . . . . . O X 1 |
$$ . . . O O O X O . |
$$ . . O X O X X X 4 |
$$ . . . X . X . 2 . |
$$ . . . . . X O . . |
$$ . . . . . . . . . |
$$ . . . . . . . . . |
$$ . . . . . . . . . |
$$ . . . . . . O . . |
$$ . . . . . . . . . |[/go]




This last diagram looks very convincing for why the capture is better!

Top
 Profile  
 
Offline
 Post subject: Re: AlphaGo Zero Patterns
Post #7 Posted: Tue Oct 24, 2017 7:35 pm 
Honinbo

Posts: 10905
Liked others: 3651
Was liked: 3374
sorin wrote:
Uberdude wrote:
Nice article, but I don't think not playing the wedge in double approach is a blind spot for humans: I've had a general preference for corner hane for some years now and that's based on my understanding that plenty of human pros (particularly Chinese?) have preferred it since quite some time before AlphaGo existed.


Indeed, the popularity of the wedge vs hane declined in time, but it still appears in pro games even in 2016. AlphaGo seems to a have much stronger opinion about it :-)


I suspect that AlphaGo Zero has a path dependency. That is, even though it trains against a previous version, and ought to correct errors eventually, one version and the next share many assumptions, not all of which are correct.

The main focus of the AlphaGo team is machine learning, not go learning. That is, their aim was to produce a strong go program, not to answer questions about go. It would be quite interesting to see AlphaGo Zero.Two, an independent program which follows its own path. They might well play different styles and favor different plays. My guess is that they would. :)

_________________
The Adkins Principle:
At some point, doesn't thinking have to go on?
— Winona Adkins

Visualize whirled peas.

Everything with love. Stay safe.


This post by Bill Spight was liked by: sorin
Top
 Profile  
 
Offline
 Post subject: Re: AlphaGo Zero Patterns
Post #8 Posted: Tue Oct 24, 2017 9:25 pm 
Lives in gote

Posts: 388
Liked others: 416
Was liked: 198
Bill Spight wrote:
I suspect that AlphaGo Zero has a path dependency. That is, even though it trains against a previous version, and ought to correct errors eventually, one version and the next share many assumptions, not all of which are correct.

The main focus of the AlphaGo team is machine learning, not go learning. That is, their aim was to produce a strong go program, not to answer questions about go. It would be quite interesting to see AlphaGo Zero.Two, an independent program which follows its own path. They might well play different styles and favor different plays. My guess is that they would. :)


That is a fascinating topic, it occurred to me too - the question "would the learning from scratch always converge to the same point/style".

Given that AlphaGo Zero ended up playing pretty similarly to the human style though (favoring after all 3rd and 4th lines mostly, in the beginning of the game) in both of 20-block and 40-block versions, seems to suggest that there is "style convergence" (to some extent).

I would like very much to see the "trajectory" it took during learning; the published paper showed the frequency of a few patterns, but I would like to see in more detail how does the style change between amateur 1-dan and pro 9-dan stages...

Top
 Profile  
 
Offline
 Post subject: Re: AlphaGo Zero Patterns
Post #9 Posted: Wed Oct 25, 2017 6:10 am 
Lives in gote

Posts: 311
Liked others: 0
Was liked: 45
Rank: 2d
sorin wrote:
Bill Spight wrote:
It would be quite interesting to see AlphaGo Zero.Two, an independent program which follows its own path. They might well play different styles and favor different plays. My guess is that they would. :)
That is a fascinating topic, it occurred to me too - the question "would the learning from scratch always converge to the same point/style".

Given that AlphaGo Zero ended up playing pretty similarly to the human style though (favoring after all 3rd and 4th lines mostly, in the beginning of the game) in both of 20-block and 40-block versions, seems to suggest that there is "style convergence" (to some extent).
I think it's not just coincidence that AG0 ended up at very similar levels than AGM, only a little bit stronger. (And there may be other reasons besides the trivial one: DM only developed it until it was able to reach this goal.)

Top
 Profile  
 
Display posts from previous:  Sort by  
Post new topic Reply to topic  [ 9 posts ] 

All times are UTC - 8 hours [ DST ]


Who is online

Users browsing this forum: No registered users and 1 guest


You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot post attachments in this forum

Search for:
Jump to:  
Powered by phpBB © 2000, 2002, 2005, 2007 phpBB Group