Elf teaches us how to play the black hole vs LeelaZero

For discussing go computing, software announcements, etc.
Uberdude
Judan
Posts: 6727
Joined: Thu Nov 24, 2011 11:35 am
Rank: UK 4 dan
GD Posts: 0
KGS: Uberdude 4d
OGS: Uberdude 7d
Location: Cambridge, UK
Has thanked: 436 times
Been thanked: 3718 times

Elf teaches us how to play the black hole vs LeelaZero

Post by Uberdude »

baduk1 continues his interesting bot vs bot experiments with forcing Elf to play the black hole opening vs LeelaZero, Elf gave itself 4% winrate to start:



User avatar
oren
Oza
Posts: 2777
Joined: Sun Apr 18, 2010 5:54 pm
GD Posts: 0
KGS: oren
Tygem: oren740, orenl
IGS: oren
Wbaduk: oren
Location: Seattle, WA
Has thanked: 251 times
Been thanked: 549 times

Re: Elf teaches us how to play the black hole vs LeelaZero

Post by oren »

What was the end result?
Tryss
Lives in gote
Posts: 502
Joined: Tue May 24, 2011 1:07 pm
Rank: KGS 2k
GD Posts: 100
KGS: Tryss
Has thanked: 1 time
Been thanked: 153 times

Re: Elf teaches us how to play the black hole vs LeelaZero

Post by Tryss »

If I'm right, black win by 1.5 points
Baywa
Dies in gote
Posts: 39
Joined: Wed Feb 22, 2017 6:37 am
GD Posts: 0
Has thanked: 40 times
Been thanked: 10 times

Re: Elf teaches us how to play the black hole vs LeelaZero

Post by Baywa »

Black wins by 1.5 points if I count correctly.

Edit: Tryss was quicker-
Last edited by Baywa on Fri Jun 15, 2018 4:27 pm, edited 1 time in total.
Couch Potato - I'm just watchin'!
Bill Spight
Honinbo
Posts: 10905
Joined: Wed Apr 21, 2010 1:24 pm
Has thanked: 3651 times
Been thanked: 3373 times

Re: Elf teaches us how to play the black hole vs LeelaZero

Post by Bill Spight »

#me too. :D
The Adkins Principle:
At some point, doesn't thinking have to go on?
— Winona Adkins

Visualize whirled peas.

Everything with love. Stay safe.
Uberdude
Judan
Posts: 6727
Joined: Thu Nov 24, 2011 11:35 am
Rank: UK 4 dan
GD Posts: 0
KGS: Uberdude 4d
OGS: Uberdude 7d
Location: Cambridge, UK
Has thanked: 436 times
Been thanked: 3718 times

Re: Elf teaches us how to play the black hole vs LeelaZero

Post by Uberdude »

Me too, but I didn't add it to the sgf so as not to spoil the ending. :)

Something I'd like to check here is did LZ fail to see Elf's good moves or see but misevaluate them?
User avatar
ez4u
Oza
Posts: 2414
Joined: Wed Feb 23, 2011 10:15 pm
Rank: Jp 6 dan
GD Posts: 0
KGS: ez4u
Location: Tokyo, Japan
Has thanked: 2351 times
Been thanked: 1332 times

Re: Elf teaches us how to play the black hole vs LeelaZero

Post by ez4u »

The question is what happens when you switch colors? Or else give LeelaZero 16k visits? :rambo:
Dave Sigaty
"Short-lived are both the praiser and the praised, and rememberer and the remembered..."
- Marcus Aurelius; Meditations, VIII 21
Uberdude
Judan
Posts: 6727
Joined: Thu Nov 24, 2011 11:35 am
Rank: UK 4 dan
GD Posts: 0
KGS: Uberdude 4d
OGS: Uberdude 7d
Location: Cambridge, UK
Has thanked: 436 times
Been thanked: 3718 times

Re: Elf teaches us how to play the black hole vs LeelaZero

Post by Uberdude »

Uberdude wrote:Something I'd like to check here is did LZ fail to see Elf's good moves or see but misevaluate them?


So the first move of Elf's that stood out to me as a strong move was q7. Leela #145 doesn't consider this at all until 1k nodes (the 2 extends are main choices), around 2k it notices it is strong and examines it more and is #1 by 3k with a 27.4% winrate. It gave the inside hane a few moves prior 25.2%. I then updated to the latest #149 network and it found q7 within 300 playouts and was #1 in under 1k.
Uberdude
Judan
Posts: 6727
Joined: Thu Nov 24, 2011 11:35 am
Rank: UK 4 dan
GD Posts: 0
KGS: Uberdude 4d
OGS: Uberdude 7d
Location: Cambridge, UK
Has thanked: 436 times
Been thanked: 3718 times

Re: Elf teaches us how to play the black hole vs LeelaZero

Post by Uberdude »

Next nice move of Elf, this cutting across knight's move in response to white j16. It's the natural shape weakness so I think plenty of strong humans could find it, but the timing is smart. Letting LeelaZero #150 analyse for ages she considers k15 practically the only move with 120k playouts and 28.4%, e17 gets 5 (not 5k). Once played LZ quickly realises it's a good move, black win up to 35.3%, a 7% swing which is pretty huge for 1 move. A severe blindspot, looks like LZ needs to be more exploratory. The continuation is as expected.

Click Here To Show Diagram Code
[go]$$Wcm54
$$ ---------------------------------------
$$ | . . . . . . . . . . . . . . . . . . . |
$$ | . . . . . . . . . . . . . . X O . . . |
$$ | . . . O 2 . O . X . . . . . X . . . . |
$$ | . . O , . O . O 1 X . . . . . , O . . |
$$ | . . X X . X X . . . . . . . . X O . . |
$$ | . . . . . . . . . . . . . . . X . . . |
$$ | . . . . . . . . . . . . . . X . O . . |
$$ | . . . . . . . . . . . . . . . . . . . |
$$ | . . . . . . . . . . . . . . . . X . . |
$$ | . . O , . . . . . , . . . . . , O . . |
$$ | . . . . . . . . . . . . . . . . . O . |
$$ | . . . . . . . . . . . . . . . . O . O |
$$ | . . O . X . . . . . . . . . . X X O . |
$$ | . . . . . . . . . . . . . . . X . X . |
$$ | . . . . . . . . . . . . X . . X . X . |
$$ | . . O , . . . . . , . . . . X O X . . |
$$ | . . . . X . . . O . O . . O O O X . . |
$$ | . . . . . . . . . . . . . . O X X . . |
$$ | . . . . . . . . . . . . . . . . . . . |
$$ ---------------------------------------[/go]
Bill Spight
Honinbo
Posts: 10905
Joined: Wed Apr 21, 2010 1:24 pm
Has thanked: 3651 times
Been thanked: 3373 times

Re: Elf teaches us how to play the black hole vs LeelaZero

Post by Bill Spight »

Uberdude wrote:Next nice move of Elf, this cutting across knight's move in response to white j16. It's the natural shape weakness so I think plenty of strong humans could find it, but the timing is smart. Letting LeelaZero #150 analyse for ages she considers k15 practically the only move with 120k playouts and 28.4%, e17 gets 5 (not 5k). Once played LZ quickly realises it's a good move, black win up to 35.3%, a 7% swing which is pretty huge for 1 move. A severe blindspot, looks like LZ needs to be more exploratory.

(Emphasis mine.)

Is that the result of self-play? Or is it just one of those things that we can expect and have to put up with? OC, no player is going to be perfect, but it seems like the current Zero bots have deep, if contained, weaknesses that they might not have if they had trained against a variety of opponents.
The Adkins Principle:
At some point, doesn't thinking have to go on?
— Winona Adkins

Visualize whirled peas.

Everything with love. Stay safe.
moha
Lives in gote
Posts: 311
Joined: Wed May 31, 2017 6:49 am
Rank: 2d
GD Posts: 0
Been thanked: 45 times

Re: Elf teaches us how to play the black hole vs LeelaZero

Post by moha »

I think such blind spots are a tricky problem also involving the search code of the bot. Even in training it will only learn moves that its search can find within the low selfplay visit limit (in each position it is trained towards search results). So if there is a blind spot in the net, that can only get fixed if the bot doesn't rely too much on its policy during selfplay search - otherwise it will just keep reinforcing the oversight.

It seems this problem affects bots to different extent though, with LZ search being one of the most rigid / least exploratory. But it also uses ELF net for some selfplay games now, so some blind spots should start to close (ELF based search can see different moves).
Uberdude
Judan
Posts: 6727
Joined: Thu Nov 24, 2011 11:35 am
Rank: UK 4 dan
GD Posts: 0
KGS: Uberdude 4d
OGS: Uberdude 7d
Location: Cambridge, UK
Has thanked: 436 times
Been thanked: 3718 times

Re: Elf teaches us how to play the black hole vs LeelaZero

Post by Uberdude »

A relevant game I came across today, Alexander Dinerstein 3p shows us how not to use the black hole opening against the much stronger Mi Yuting 9p:

jokkebk
Dies in gote
Posts: 44
Joined: Tue Feb 01, 2011 4:47 am
Rank: EGF 1 kyu
GD Posts: 0
KGS: finity
Has thanked: 2 times
Been thanked: 14 times

Re: Elf teaches us how to play the black hole vs LeelaZero

Post by jokkebk »

Alexander is a great teacher to us all. I hope we'll see many more educational games in the future!
Bill Spight
Honinbo
Posts: 10905
Joined: Wed Apr 21, 2010 1:24 pm
Has thanked: 3651 times
Been thanked: 3373 times

Re: Elf teaches us how to play the black hole vs LeelaZero

Post by Bill Spight »

moha wrote:I think such blind spots are a tricky problem also involving the search code of the bot. Even in training it will only learn moves that its search can find within the low selfplay visit limit (in each position it is trained towards search results). So if there is a blind spot in the net, that can only get fixed if the bot doesn't rely too much on its policy during selfplay search - otherwise it will just keep reinforcing the oversight.


IIUC, Monte Carlo bots are guaranteed to find the correct play in infinite time. I would be surprised if the Zero bots don't meet the same criterion. OC, that does not mean that they cannot develop blind spots that will last for millennia. :lol:
The Adkins Principle:
At some point, doesn't thinking have to go on?
— Winona Adkins

Visualize whirled peas.

Everything with love. Stay safe.
moha
Lives in gote
Posts: 311
Joined: Wed May 31, 2017 6:49 am
Rank: 2d
GD Posts: 0
Been thanked: 45 times

Re: Elf teaches us how to play the black hole vs LeelaZero

Post by moha »

Bill Spight wrote:
moha wrote:I think such blind spots are a tricky problem also involving the search code of the bot. Even in training it will only learn moves that its search can find within the low selfplay visit limit (in each position it is trained towards search results). So if there is a blind spot in the net, that can only get fixed if the bot doesn't rely too much on its policy during selfplay search - otherwise it will just keep reinforcing the oversight.
IIUC, Monte Carlo bots are guaranteed to find the correct play in infinite time. I would be surprised if the Zero bots don't meet the same criterion. OC, that does not mean that they cannot develop blind spots that will last for millennia. :lol:
This convergence you mention means the bot will find the best move if it is run in a given position for infinity (though even this only holds if all moves are guaranteed to get infinite further visits, which may not always be the case with NN-based pruning). But there is no infinite search during selfplay, and the training problem I wrote above may not show the same convergence.

In a given position 2-3k visits may give the same results as the current raw policy net (which can be an oversight, and, say, 100k visits could find a better move - but this will never happen). In this case no learning takes place - the network is just just trained towards its current output, reinforcing it. So the blind spot could only be fixed if the bot comes across a position where it manifests AND the visit limit is enough for the search to find the correct move (or at least slightly more correct evaluations) EVEN if started from the wrong policy distribution. This latter part, the level of exploration is where various bots differ: some only look at moves that have decent policy weights, some spend a few visits on less promising moves as well.
Post Reply