It is currently Wed Apr 24, 2024 9:42 pm

All times are UTC - 8 hours [ DST ]




Post new topic Reply to topic  [ 36 posts ]  Go to page Previous  1, 2
Author Message
Offline
 Post subject: Re: Leela Zero Stuck
Post #21 Posted: Tue Jan 09, 2018 5:43 pm 
Honinbo

Posts: 10905
Liked others: 3651
Was liked: 3374
Uberdude wrote:
LeelaZero having ladder troubles against a human on OGS
moha wrote:
I would expect the network to realize it's own vision range problem (from selfplay), and preemptively avoid unclear ladders. It's interesting this does not happen.
Bill Spight wrote:
Perhaps mutual blindness? Neither version of LeelaZero whether the ladder works, and so it is never played out, and LeelaZero never learns. (Or it will take a long time.) As for avoiding unclear ladders, not much is clear, is it? Especially in the opening. ;)
There is a point in playing out unread ladders though, for randomization. So during selfplay the side behind may play it.


How often will the next step of a ladder occur with quasi-random play?

_________________
The Adkins Principle:
At some point, doesn't thinking have to go on?
— Winona Adkins

Visualize whirled peas.

Everything with love. Stay safe.

Top
 Profile  
 
Offline
 Post subject: Re: Leela Zero Stuck
Post #22 Posted: Tue Jan 09, 2018 5:59 pm 
Dies in gote

Posts: 34
Liked others: 0
Was liked: 3
Rank: some SDK
Universal go server handle: kwhyte
I think the net will first learn to trust that all unclear ladders work. It is a much worse mistake to run out of a long ladder then eventually works than to chase stones in a long ladder that eventually fails. It is true that the next step of a ladder is always likely to be somewhat likely in quasi-randomized play since the net learns quickly that you should consider all ataris, so playing the ladder out for a few steps should still happen sometimes, but for ladders long enough to get the net into trouble that's a lot of coincidence (and all the random trials where it plays the ladder out a third of the way across the board and then plays away are going to end badly - which will it learn faster, don't play unclear ladders or don't stop playing a ladder that's already gone a few steps?).

Top
 Profile  
 
Offline
 Post subject: Re: Leela Zero Stuck
Post #23 Posted: Tue Jan 09, 2018 6:48 pm 
Lives in gote

Posts: 311
Liked others: 0
Was liked: 45
Rank: 2d
Bill Spight wrote:
moha wrote:
There is a point in playing out unread ladders though, for randomization. So during selfplay the side behind may play it.
How often will the next step of a ladder occur with quasi-random play?
Why quasi-random play?

Top
 Profile  
 
Offline
 Post subject: Re: Leela Zero Stuck
Post #24 Posted: Tue Jan 09, 2018 6:51 pm 
Honinbo

Posts: 10905
Liked others: 3651
Was liked: 3374
With probabilistic reasoning, perhaps there is a threshold, partly dependent upon circumstances, where the ladder is long enough for both sides to play out, because losing the long ladder will lose the game. Playing it out is the only chance to win.

_________________
The Adkins Principle:
At some point, doesn't thinking have to go on?
— Winona Adkins

Visualize whirled peas.

Everything with love. Stay safe.

Top
 Profile  
 
Offline
 Post subject: Re: Leela Zero Stuck
Post #25 Posted: Tue Jan 09, 2018 9:06 pm 
Honinbo

Posts: 10905
Liked others: 3651
Was liked: 3374
moha wrote:
Bill Spight wrote:
moha wrote:
There is a point in playing out unread ladders though, for randomization. So during selfplay the side behind may play it.
How often will the next step of a ladder occur with quasi-random play?
Why quasi-random play?


Because I did not think that the choice of plays was completely random. :-|

_________________
The Adkins Principle:
At some point, doesn't thinking have to go on?
— Winona Adkins

Visualize whirled peas.

Everything with love. Stay safe.

Top
 Profile  
 
Offline
 Post subject: Re: Leela Zero Stuck
Post #26 Posted: Wed Jan 10, 2018 3:53 am 
Judan

Posts: 6725
Location: Cambridge, UK
Liked others: 436
Was liked: 3719
Rank: UK 4 dan
KGS: Uberdude 4d
OGS: Uberdude 7d
Is Bill's point that the quasi-random play of monte carlo rollouts will not be very good at playing ladders so it won't learn them well, but moha's counter point that Leela Zero doesn't use monte carlo rollouts?

Top
 Profile  
 
Offline
 Post subject: Re: Leela Zero Stuck
Post #27 Posted: Wed Jan 10, 2018 5:54 am 
Lives in gote

Posts: 311
Liked others: 0
Was liked: 45
Rank: 2d
Uberdude wrote:
Is Bill's point that the quasi-random play of monte carlo rollouts will not be very good at playing ladders so it won't learn them well, but moha's counter point that Leela Zero doesn't use monte carlo rollouts?
I actually have trouble following the logic myself. :)

My original idea was that - as seen with AlphaGo vs. humans - a bot can understand something like "danger, unclear". And avoid it (even at some cost) when ahead but seek it when behind. With a smallnet Zero selfplay, the bot has no reason to fear being exploited (since the opponent cannot see/read the ladder either), so what remains is "random" probability (e.g. ladders are a bit more likely to work on empty boards than later stages) with huge variance on game result.

I guess Bill may hint at difficulties for the early network in collecting experience at all, thus reaching this point? At first stages with close to random play, ladders are not played. Deepmind also noted that - unlike humans - a bot learns about ladders relatively late.

Top
 Profile  
 
Offline
 Post subject: Re: Leela Zero Stuck
Post #28 Posted: Wed Jan 10, 2018 6:46 am 
Honinbo

Posts: 10905
Liked others: 3651
Was liked: 3374
Uberdude wrote:
Is Bill's point that the quasi-random play of monte carlo rollouts will not be very good at playing ladders so it won't learn them well, but moha's counter point that Leela Zero doesn't use monte carlo rollouts?


No, my point is that if you have two players who share a blindness, it is difficult for them to learn from each other to overcome that blindness. It has nothing to do with AI per se. Humans have the same problem.

It was moha who brought up randomness in the play, and I thought that it was not pure randomness, hence "quasi-random".

_________________
The Adkins Principle:
At some point, doesn't thinking have to go on?
— Winona Adkins

Visualize whirled peas.

Everything with love. Stay safe.


Last edited by Bill Spight on Wed Jan 10, 2018 7:05 am, edited 1 time in total.
Top
 Profile  
 
Offline
 Post subject: Re: Leela Zero Stuck
Post #29 Posted: Wed Jan 10, 2018 6:52 am 
Honinbo

Posts: 10905
Liked others: 3651
Was liked: 3374
moha wrote:
Deepmind also noted that - unlike humans - a bot learns about ladders relatively late.


Right. Even human beginners can apply logic. (Ladders are logical.) Could bots learn ladders using logic? Of course. But not the bots that are currently in vogue.

_________________
The Adkins Principle:
At some point, doesn't thinking have to go on?
— Winona Adkins

Visualize whirled peas.

Everything with love. Stay safe.

Top
 Profile  
 
Offline
 Post subject: Re: Leela Zero Stuck
Post #30 Posted: Tue Jan 16, 2018 4:37 am 
Judan

Posts: 6725
Location: Cambridge, UK
Liked others: 436
Was liked: 3719
Rank: UK 4 dan
KGS: Uberdude 4d
OGS: Uberdude 7d
Something I found interesting to ponder about: Leela Zero is about 1 dan strength now I believe (+/- a few stones) and it likes to play early 3-3 invasions just like AlphaGo (both regular and Zero version) does. It's not very strong yet so it's not some respected oracle like AG that we would emulate, but does this mean it has also discovered some objective truth that early 3-3s are good (which is basically what we have surmised from AG), or is this more like some bias or quirk of how machine learning Go programs work and they get stuck playing them. AG never outgrew this trait and got really strong, but is it really strong because of or despite these 3-3 invasions?

Top
 Profile  
 
Offline
 Post subject: Re: Leela Zero Stuck
Post #31 Posted: Tue Jan 16, 2018 10:50 am 
Dies in gote

Posts: 52
Liked others: 2
Was liked: 12
I've also been wondering about the early 3-3 playing style. One thing that is on my mind, is the settings under which the network is trained.

LeelaZ/AlphaGo plays every training game with only 1600 playouts, added noise, and extra randomization in the opening.

The value which the network puts to each move, is based on games under these settings. If an opening is 47% winrate for black, it is because to the network, it looks like something where it would win 47% of the time as black under the above settings.

So you can ask - what kind of strategy can you develop when you and your opponent have limited reading ability, and you're pseudo drunk? If killing groups inside a moyo is harder in general and requires precise reading, then taking guaranteed territory sounds pretty tempting, but that's of course just speculation :)

So to expand upon the question of whether or not these moves are objectively good for any skill level, we can also ask if they're objectively good under any time setting/reading ability.

Would a network trained with 100.000 playouts in its training games play differently?


This post by Yakago was liked by: Waylon
Top
 Profile  
 
Offline
 Post subject: Re: Leela Zero Stuck
Post #32 Posted: Tue Jan 16, 2018 12:59 pm 
Honinbo

Posts: 10905
Liked others: 3651
Was liked: 3374
Yakago wrote:
I've also been wondering about the early 3-3 playing style. One thing that is on my mind, is the settings under which the network is trained.

LeelaZ/AlphaGo plays every training game with only 1600 playouts, added noise, and extra randomization in the opening.

The value which the network puts to each move, is based on games under these settings. If an opening is 47% winrate for black, it is because to the network, it looks like something where it would win 47% of the time as black under the above settings.

So you can ask - what kind of strategy can you develop when you and your opponent have limited reading ability, and you're pseudo drunk? If killing groups inside a moyo is harder in general and requires precise reading, then taking guaranteed territory sounds pretty tempting, but that's of course just speculation :)

So to expand upon the question of whether or not these moves are objectively good for any skill level, we can also ask if they're objectively good under any time setting/reading ability.

Would a network trained with 100.000 playouts in its training games play differently?


A quick note. The evaluation in terms of percentage of wins assumes errors in play. Otherwise the evaluation will be 100% or 0%. ;) We cannot therefore identify objectively good or bad plays.

_________________
The Adkins Principle:
At some point, doesn't thinking have to go on?
— Winona Adkins

Visualize whirled peas.

Everything with love. Stay safe.

Top
 Profile  
 
Offline
 Post subject: Re: Leela Zero Stuck
Post #33 Posted: Tue Jan 16, 2018 7:17 pm 
Tengen

Posts: 4380
Location: North Carolina
Liked others: 499
Was liked: 733
Rank: AGA 3k
GD Posts: 65
OGS: Hyperpape 4k
Bill Spight wrote:
A quick note. The evaluation in terms of percentage of wins assumes errors in play. Otherwise the evaluation will be 100% or 0%. ;) We cannot therefore identify objectively good or bad plays.
This is good pedantry, but I think we can improve it ;).

The evaluation in terms of percentage of wins is the product of a process which incorporates errors in play. Otherwise the evaluation will be 100% or 0%. ;) We cannot therefore identify objectively good or bad plays without some probability of error (and we do not know how big that error is).

Big picture: we don't know how far AlphaGo was from perfect play, so at some level, we can cast doubt on all of its evaluations. It's possible that the early 3-3 points are just a bias that were never exposed because humans aren't strong enough to take advantage of them. However, if we want to assess whether a move is objectively good, in most cases our best shot will be the AlphaGo oracle—for now.

_________________
Occupy Babel!

Top
 Profile  
 
Offline
 Post subject: Re: Leela Zero Stuck
Post #34 Posted: Wed Jan 17, 2018 1:28 am 
Lives in gote

Posts: 340
Location: Spain
Liked others: 181
Was liked: 41
Rank: Low
Also: there is no reason to assume AlphaGo is any more biased than we are. All we know is it's better than us, so, in principle, we should assume its biases are better than ours, too. ;-)


This post by luigi was liked by: Waylon
Top
 Profile  
 
Offline
 Post subject: Re: Leela Zero Stuck
Post #35 Posted: Fri Jan 19, 2018 12:23 am 
Judan

Posts: 6725
Location: Cambridge, UK
Liked others: 436
Was liked: 3719
Rank: UK 4 dan
KGS: Uberdude 4d
OGS: Uberdude 7d
luigi wrote:
All we know is it's better than us, so, in principle, we should assume its biases are better than ours, too.

My point is this bias of liking early 3-3 invasions is not only present when it is better than us though. AG and Leela Zero like them early in their training when they are weak players, less than 1 dan. It seems odd to me that such a weak player could have discovered something closer to the objective truth of go than humans did collectively over millenia of play (well I suppose opening at 4-4 is relatively recent if we don't count the ancient chinese cross 4-4 starting position). So something like Bill suggested that a preference for corner over centre territory arrising from the way tromp taylor rules score incomplete positions seems a plausible avenue of investigation. Or maybe it's just because they are simple. Just thinking out loud AG Lee didn't do early 3-3s but Master did. Can we explain this as it needing a lot of self play to rid itself of the human bias of not playing them, whereas training from zero it can start liking them early. I suppose AG/L Zero also start doing knight move approaches to corner stones when they are weak, but as that agrees with established human knowledge I'm not surprised that they don't "grow out" of this habit or see it as remarkable.

Top
 Profile  
 
Offline
 Post subject: Re: Leela Zero Stuck
Post #36 Posted: Fri Jan 26, 2018 8:47 am 
Lives with ko

Posts: 180
Liked others: 15
Was liked: 23
Rank: Beginner
Since a few days Leela Zero is on a 6x128 Network and since a fix yesterday with great improvements: http://zero.sjeng.org/

It will likely soon catch up the original Leela!

Top
 Profile  
 
Display posts from previous:  Sort by  
Post new topic Reply to topic  [ 36 posts ]  Go to page Previous  1, 2

All times are UTC - 8 hours [ DST ]


Who is online

Users browsing this forum: No registered users and 1 guest


You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot post attachments in this forum

Search for:
Jump to:  
Powered by phpBB © 2000, 2002, 2005, 2007 phpBB Group