AlphaGo Zero Patterns

sorin · #1

I wrote an article about a few patterns I noticed in AlphaGo Zero's published games: http://361points.com/articles/alphago_zero_patterns/

Uberdude · #2

Nice article, but I don't think not playing the wedge in double approach is a blind spot for humans: I've had a general preference for corner hane for some years now and that's based on my understanding that plenty of human pros (particularly Chinese?) have preferred it since quite some time before AlphaGo existed.

Also 7 in pattern 1 rather than direct capture surprised me. Does it fear Black tenuki if capture so loses a little extra points to be sure of sente? I suppose p14 is also a possible shape followup and it creates more ko threats.

RobertJasiek · #3

I am only surprised by one move:

Click Here To Show Diagram Code: [go]$$B $$ ------------------ $$ . . . . . . . O . | $$ . . . . . . O X . | $$ . . . O O O X O 1 | $$ . . O X O X X X . | $$ . . . X . X . . . | $$ . . . . . X O . . | $$ . . . . . . . . . | $$ . . . . . . . . . | $$ . . . . . . . . . | $$ . . . . . . O . . | $$ . . . . . . . . . |[/go]

At first, I thought that the following provides better eye shape and better endgame:

Click Here To Show Diagram Code: [go]$$B $$ ------------------ $$ . . . . . . . O . | $$ . . . . . . O X 1 | $$ . . . O O O X O . | $$ . . O X O X X X . | $$ . . . X . X . . . | $$ . . . . . X O . . | $$ . . . . . . . . . | $$ . . . . . . . . . | $$ . . . . . . . . . | $$ . . . . . . O . . | $$ . . . . . . . . . |[/go]

But maybe AlphaGo Zero jugdes that this later attack is more relevant:

Click Here To Show Diagram Code: [go]$$B Tenuki 3 $$ ------------------ $$ . . . . . . . O . | $$ . . . . . . O X 1 | $$ . . . O O O X O . | $$ . . O X O X X X 4 | $$ . . . X . X . 2 . | $$ . . . . . X O . . | $$ . . . . . . . . . | $$ . . . . . . . . . | $$ . . . . . . . . . | $$ . . . . . . O . . | $$ . . . . . . . . . |[/go]

EDITS

Uberdude · #4

Robert, for interest, there was a somewhat similar shape in AG Master self-play #38 (see viewtopic.php?p=221574#p221574) and in that case it connected rather than capture on move 33, I think because it didn't like the 2-1 throw-in ko.

sorin · #5

Uberdude wrote:

Nice article, but I don't think not playing the wedge in double approach is a blind spot for humans: I've had a general preference for corner hane for some years now and that's based on my understanding that plenty of human pros (particularly Chinese?) have preferred it since quite some time before AlphaGo existed.

Indeed, the popularity of the wedge vs hane declined in time, but it still appears in pro games even in 2016. AlphaGo seems to a have much stronger opinion about it :-)

Uberdude wrote:

Also 7 in pattern 1 rather than direct capture surprised me. Does it fear Black tenuki if capture so loses a little extra points to be sure of sente? I suppose p14 is also a possible shape followup and it creates more ko threats.

Maybe it likes that it gets some forcing moves from the outside - like P15 - or just in general the fact that black ends up with one extra group (in theory)?

(By the way, moves 5 and 7 are interchangeable - in one of the games it plays at 7 first).

I looked this up in my SGF collection and found only one human game with a similar starting pattern, except that white's approach move at R12 is missing (there is one two spaces down instead); the one "in the corner" is some Inoue Ichiro, while "outside" is Go Seigen, who simply captured the stone like you suggested.

sorin · #6

RobertJasiek wrote:

But maybe AlphaGo Zero jugdes that this later attack is more relevant:

Click Here To Show Diagram Code: [go]$$B Tenuki 3 $$ ------------------ $$ . . . . . . . O . | $$ . . . . . . O X 1 | $$ . . . O O O X O . | $$ . . O X O X X X 4 | $$ . . . X . X . 2 . | $$ . . . . . X O . . | $$ . . . . . . . . . | $$ . . . . . . . . . | $$ . . . . . . . . . | $$ . . . . . . O . . | $$ . . . . . . . . . |[/go]

This last diagram looks very convincing for why the capture is better!

Bill Spight · #7

sorin wrote:

Uberdude wrote:

Nice article, but I don't think not playing the wedge in double approach is a blind spot for humans: I've had a general preference for corner hane for some years now and that's based on my understanding that plenty of human pros (particularly Chinese?) have preferred it since quite some time before AlphaGo existed.

Indeed, the popularity of the wedge vs hane declined in time, but it still appears in pro games even in 2016. AlphaGo seems to a have much stronger opinion about it :-)

I suspect that AlphaGo Zero has a path dependency. That is, even though it trains against a previous version, and ought to correct errors eventually, one version and the next share many assumptions, not all of which are correct.

The main focus of the AlphaGo team is machine learning, not go learning. That is, their aim was to produce a strong go program, not to answer questions about go. It would be quite interesting to see AlphaGo Zero.Two, an independent program which follows its own path. They might well play different styles and favor different plays. My guess is that they would.

sorin · #8

Bill Spight wrote:

I suspect that AlphaGo Zero has a path dependency. That is, even though it trains against a previous version, and ought to correct errors eventually, one version and the next share many assumptions, not all of which are correct.

The main focus of the AlphaGo team is machine learning, not go learning. That is, their aim was to produce a strong go program, not to answer questions about go. It would be quite interesting to see AlphaGo Zero.Two, an independent program which follows its own path. They might well play different styles and favor different plays. My guess is that they would.

That is a fascinating topic, it occurred to me too - the question "would the learning from scratch always converge to the same point/style".

Given that AlphaGo Zero ended up playing pretty similarly to the human style though (favoring after all 3rd and 4th lines mostly, in the beginning of the game) in both of 20-block and 40-block versions, seems to suggest that there is "style convergence" (to some extent).

I would like very much to see the "trajectory" it took during learning; the published paper showed the frequency of a few patterns, but I would like to see in more detail how does the style change between amateur 1-dan and pro 9-dan stages...

moha · #9

sorin wrote:

Bill Spight wrote:

It would be quite interesting to see AlphaGo Zero.Two, an independent program which follows its own path. They might well play different styles and favor different plays. My guess is that they would.

That is a fascinating topic, it occurred to me too - the question "would the learning from scratch always converge to the same point/style".

Given that AlphaGo Zero ended up playing pretty similarly to the human style though (favoring after all 3rd and 4th lines mostly, in the beginning of the game) in both of 20-block and 40-block versions, seems to suggest that there is "style convergence" (to some extent).

I think it's not just coincidence that AG0 ended up at very similar levels than AGM, only a little bit stronger. (And there may be other reasons besides the trivial one: DM only developed it until it was able to reach this goal.)

AlphaGo Zero Patterns

Who is online