Re: Leela Zero Stuck
Posted: Sun Jan 07, 2018 1:31 pm
Worth reading for those wondering about what GCP has planned for the future:
https://github.com/gcp/leela-zero/issues/591
https://github.com/gcp/leela-zero/issues/591
Life in 19x19. Go, Weiqi, Baduk... Thats the life.
https://lifein19x19.com/
I would expect the network to realize it's own vision range problem (from selfplay), and preemptively avoid unclear ladders. It's interesting this does not happen.Uberdude wrote:LeelaZero having ladder troubles against a human on OGS
Perhaps mutual blindness? Neither version of LeelaZero whether the ladder works, and so it is never played out, and LeelaZero never learns. (Or it will take a long time.) As for avoiding unclear ladders, not much is clear, is it? Especially in the opening.moha wrote:I would expect the network to realize it's own vision range problem (from selfplay), and preemptively avoid unclear ladders. It's interesting this does not happen.Uberdude wrote:LeelaZero having ladder troubles against a human on OGS
There is a point in playing out unread ladders though, for randomization. So during selfplay the side behind may play it.Bill Spight wrote:Perhaps mutual blindness? Neither version of LeelaZero whether the ladder works, and so it is never played out, and LeelaZero never learns. (Or it will take a long time.) As for avoiding unclear ladders, not much is clear, is it? Especially in the opening.moha wrote:I would expect the network to realize it's own vision range problem (from selfplay), and preemptively avoid unclear ladders. It's interesting this does not happen.Uberdude wrote:LeelaZero having ladder troubles against a human on OGS
Uberdude wrote:LeelaZero having ladder troubles against a human on OGS
How often will the next step of a ladder occur with quasi-random play?moha wrote:I would expect the network to realize it's own vision range problem (from selfplay), and preemptively avoid unclear ladders. It's interesting this does not happen.There is a point in playing out unread ladders though, for randomization. So during selfplay the side behind may play it.Bill Spight wrote:Perhaps mutual blindness? Neither version of LeelaZero whether the ladder works, and so it is never played out, and LeelaZero never learns. (Or it will take a long time.) As for avoiding unclear ladders, not much is clear, is it? Especially in the opening.
Why quasi-random play?Bill Spight wrote:How often will the next step of a ladder occur with quasi-random play?moha wrote:There is a point in playing out unread ladders though, for randomization. So during selfplay the side behind may play it.
Because I did not think that the choice of plays was completely random.moha wrote:Why quasi-random play?Bill Spight wrote:How often will the next step of a ladder occur with quasi-random play?moha wrote:There is a point in playing out unread ladders though, for randomization. So during selfplay the side behind may play it.
I actually have trouble following the logic myself.Uberdude wrote:Is Bill's point that the quasi-random play of monte carlo rollouts will not be very good at playing ladders so it won't learn them well, but moha's counter point that Leela Zero doesn't use monte carlo rollouts?
No, my point is that if you have two players who share a blindness, it is difficult for them to learn from each other to overcome that blindness. It has nothing to do with AI per se. Humans have the same problem.Uberdude wrote:Is Bill's point that the quasi-random play of monte carlo rollouts will not be very good at playing ladders so it won't learn them well, but moha's counter point that Leela Zero doesn't use monte carlo rollouts?
Right. Even human beginners can apply logic. (Ladders are logical.) Could bots learn ladders using logic? Of course. But not the bots that are currently in vogue.moha wrote:Deepmind also noted that - unlike humans - a bot learns about ladders relatively late.