Exploring LZ's search algorithm: worked examples

Bill Spight · Post by **Bill Spight** » Thu Jan 16, 2020 8:38 am

dfan wrote:
Bill Spight wrote: Well, considering that the square go board, unlike the chess board, has no essentially identifiable sides, or quadrants, or semi-quadrants, and thus has eightfold symmetry, why shouldn't the neural nets have eightfold symmetry?
I was speaking from an implementation point of view (thus my question about the form of the constraint on the convolutional kernels), rather than the mathematics of the desired output (which I agree "should be" symmetrical).
I don't know what assumptions you are making, to make things so difficult.
I'm not sure what this sentence means, but I'm also not sure it's worth pursuing! If you want to spell it out more I will try to respond, but I also don't mind just dropping it.

I've got the general idea, thanks.

As far as I am concerned, going into detail about implementation would not mean much to me until I get my new axe. Then I can play around with different ideas.

Bill Spight · Post by **Bill Spight** » Thu Jan 16, 2020 8:40 am

Mike Novack wrote:
Bill Spight wrote: I wondered if they count somewhat like parrots do. Parrots can count to 6 in the sense that they can distinguish between there being 5 objects in a small enough space and there being 6 objects there, but the difference between 6 objects and 7 objects gives them trouble. I have imagined that the bots gradually learn to distinguish between N and N-1 dame as N increases.
Not just parrots. We humans can count small numbers of things if we want to, because we have learned counting as a method for when the number of things is too great for "number recognition". But we do not ordinarily count small numbers of objects. We recognize how many (just like the parrots). I believe that there have been recent papers about AI neural nets being able to learn this ability. Keep in mind, OUR brains ARE neural nets.

Rainman excepted, I think humans may not in general be as good as parrots. For instance, there are human languages with no numbers greater than three.

xela · Post by **xela** » Thu Jan 16, 2020 10:04 pm

Bill Spight wrote:Rainman excepted, I think humans may not in general be as good as parrots. For instance, there are human languages with no numbers greater than three. :o

Can't think of too many parrot languages with numbers greater than three either.

Maharani · Post by **Maharani** » Thu Jan 16, 2020 10:39 pm

xela wrote:
Bill Spight wrote:Rainman excepted, I think humans may not in general be as good as parrots. For instance, there are human languages with no numbers greater than three.
Can't think of too many parrot languages with numbers greater than three either.

Lol

jann · Post by **jann** » Fri Jan 17, 2020 12:49 am

NNs like to start from random (non-random inits learn slower, if at all), and enforcing symmetries reduces randomness. Some papers showed that most NN convergences are direct results of separatable, luckily initialized weight subsets (the rest may even be left out). And because of the randomised use of the net in selfplay, if just one symmetry learns / finds out sth useful, that will also be taught to other symmetries soon.

xela · Post by **xela** » Fri Jan 17, 2020 9:12 pm

Bill has suggested an interesting position for analysis. It's an example of "LZ ignores a good move".

The game is Sakaguchi-Hayashi, GoGoD 1861-12-18e. After

there's an interesting fight happening in the bottom left.

Click Here To Show Diagram Code: [go]$$Bc Black to play $$ +---------------------------------------+ $$ | . . . . . . . . . . . . . . . . . . . | $$ | . . . . . . . . . . . . . . . . . . . | $$ | . . . . X . . . . . . . . . O . . . . | $$ | . . . . . . . . . . . . . . . . X . . | $$ | . . . . . . . . . . . . . . . . . . . | $$ | . . . . . . . . . . . . . . . . . . . | $$ | . . . . . . . . . . . . . . . . . . . | $$ | . . . . . . . . . . . . . . . . . . . | $$ | . . . . . . . . . . . . . . . . . . . | $$ | . . . . . . . . . . . . . . . . . . . | $$ | . . . . . . . . . . . . . . . . . . . | $$ | . . . . . . . . . . . . . . . . . . . | $$ | . . . O O O X . . . . . . . . . . . . | $$ | . O b O X X X X . . . . . . . . . . . | $$ | . a X X O O . . . . . . . . . . X . . | $$ | . . X O . O . O X . . . . . O . . . . | $$ | . . X O O . c . X . . . . . . O . . . | $$ | . . e . . f d . . . . . . . . . . . . | $$ | . . . . . . . . . . . . . . . . . . . | $$ +---------------------------------------+[/go]

Looking at the policy network, a is the "first instinct" move (70-80%), and all of b through d are interesting (3-10%). Possibly e and f are worth a quick look too (around 1%). LZ-258, ELF and KataGo 1.2 all broadly agree on the policy values. (Installing KataGo 1.3 is on my to-do list!)

What would you play here? Spoilers behind the cut.

Here's the full game for your entertainment. More detailed AI analysis to come soon...

xela · Post by **xela** » Sat Jan 18, 2020 6:12 am

Here's a trace of 10,000 playouts (SGf summary only, the CSV files for this many playouts are huge!) I'll give you three versions:

LZ-258 from the starting position: it spends most of its time exploring B5, and only gives 43 playouts to G3. (Playing with different random number seeds, I can persuade it to give over 60 visits to G3, but the evaluations don't change much.)
LZ-258 analysing the position after G3 has been played.
LZ with the ELF network from the starting position. It still looks at B5 first, but starts paying serious attention to G3 after about 2,000 playouts, and looks at G3 exclusively from playout 4,319 onwards.

Remember that the difference between LZ-258 and LZ-ELF here is only the network. It's still the same software running the same search algorithm. So how do the different networks affect the choice of B5 or G3?

The policy values tell us that both networks will look at B5 first. But that's not the full story. LZ-258 gets to G3 at playout 93, and on a first look thinks that it's a 60% winrate for white, not that much different from the initial 59% for B5. But after a few playouts, G3 starts to look worse, so LZ-258 gives up on it. LZ-ELF starts off similarly, looking at G3 from playout 21, and scoring G3 at 46% for white compared with 49% for B5, again in the same ballpark. But on more playouts, LZ-ELF rates B5 as getting a lot worse for black, while the rating for G3 doesn't change much. So the difference isn't between those two positions specifically, but further down the tree.

Looking at the variations after B5, I haven't yet found a massive difference between 258 and ELF. But here's something interesting for the G3 variations:

Click Here To Show Diagram Code: [go]$$Bc $$ | . . . . . . . . . . $$ | . . . O O O X . . . $$ | a O 6 O X X X X . . $$ | . 5 X X O O . . . . $$ | b . X O . O . O X . $$ | . . X O O . 1 . X . $$ | . 3 2 4 . . . . . . $$ | . c . . . . . . . . $$ +--------------------[/go]

Both networks have a as the first instinct. But it's not actually a good move. ELF spends 15 playouts on it, and then gives 994 playouts to c (and has another look at a later on). Meanwhile it's been spending a lot of time on B5 at the start, so it's not until around playout number 3,000 that c is seen to be clearly better than a in this diagram.

LZ-258 gives 18 playouts to a, but then its second choice is b, which also doesn't work well (and is discarded after four playouts). Starting from the initial position, LZ-258 will never even look at c (the policy value is around 1%). By playout number 5,000 it's given up on G3 (at least for the time being; it will come back by playout number 100,000 if you let it run that long). Maybe this is a little blind spot in LZ-258's network that skews the evaluations slightly. Not the full story, there are still a lot of other variations to look at.

Finally, G3 in the initial position has a policy value of around 7% for LZ-258, or around 10% for ELF. Does that 3% difference have a big influence on the search? I suspect not -- we've seen examples of where 1% moves can get a lot of playouts if the evaluations are promising -- but it's hard to say for sure (I've tried scribbling down some equations based on the UCT formulae, and the maths gets pretty messy). I might come back to that later. Or I might get distracted by something else...

xela · Post by **xela** » Tue Mar 03, 2020 5:01 pm

I was going to post some ladder positions here, but it got complicated, so I started a separate thread for them.

Life In 19x19

Exploring LZ's search algorithm: worked examples

Re: Exploring LZ's search algorithm: worked examples

Re: Exploring LZ's search algorithm: worked examples

Re: Exploring LZ's search algorithm: worked examples

Re: Exploring LZ's search algorithm: worked examples

Re: Exploring LZ's search algorithm: worked examples

Re: Exploring LZ's search algorithm: worked examples

Re: Exploring LZ's search algorithm: worked examples

Re: Exploring LZ's search algorithm: worked examples