AlphaZero paper discussion (Mastering Go, Chess, and Shogi)

For discussing go computing, software announcements, etc.
Uberdude
Judan
Posts: 6727
Joined: Thu Nov 24, 2011 11:35 am
Rank: UK 4 dan
GD Posts: 0
KGS: Uberdude 4d
OGS: Uberdude 7d
Location: Cambridge, UK
Has thanked: 436 times
Been thanked: 3718 times

Re: AlphaZero paper discussion (Mastering Go, Chess, and Sho

Post by Uberdude »

I don't understand much of backgammon, bur found it interesting that TD-Gammon upended some opening theory and made humans start playing moves they previously thought were bad, rather like AlphaGo and the early 3-3 invasions.
User avatar
djhbrown
Lives in gote
Posts: 392
Joined: Tue Sep 15, 2015 5:00 pm
Rank: NR
GD Posts: 0
Has thanked: 23 times
Been thanked: 43 times

Re: AlphaZero paper discussion (Mastering Go, Chess, and Sho

Post by djhbrown »

Uberdude wrote:His recent output is not of professional quality.
Please can someone who is competent to judge explain why Uberdude who has never manifested any evidence of professional competence in AI has licence to make repeated personal abuse attacks upon me in this forum over several years without ever receiving a reprimand from a moderator?

Being good at Go does not make one good at being human.
i shrink, therefore i swarm
Kirby
Honinbo
Posts: 9553
Joined: Wed Feb 24, 2010 6:04 pm
GD Posts: 0
KGS: Kirby
Tygem: 커비라고해
Has thanked: 1583 times
Been thanked: 1707 times

Re: AlphaZero paper discussion (Mastering Go, Chess, and Sho

Post by Kirby »

dfan wrote:
moha wrote:
dfan wrote:if you handed a bunch of 1971-era AI researchers 2017-level hardware, they wouldn't be able to create AlphaGo
They weren't hopeless though. :)

Indeed, though note that TD-Gammon was developed 21 years after 1971 and benefited from (and extended!) the theories of reinforcement learning developed during those decades. (I saw Tesauro talk at a conference last year about his experience creating TD-Gammon and it was quite interesting.)


I agree that advancements have been made from AI outside of hardware. But it's also hard to separate the two. For example, a number of research advancements may have been made possible with increased computing power and availability.

It's somewhat of a hypothetical argument, similar to how people talk about who would be stronger between a "modern day Shusaku" and, for example, Ke Jie. It's nice to think about as a thought experiment and make various reasoning, but the idea can't extend the limits of its hypothetical nature.

As to whether 1971 researchers could replicate AlphaGo, I'd say that dfan is probably right, but who really knows?
be immersed
Kirby
Honinbo
Posts: 9553
Joined: Wed Feb 24, 2010 6:04 pm
GD Posts: 0
KGS: Kirby
Tygem: 커비라고해
Has thanked: 1583 times
Been thanked: 1707 times

Re: AlphaZero paper discussion (Mastering Go, Chess, and Sho

Post by Kirby »

djhbrown wrote:
Uberdude wrote:His recent output is not of professional quality.
Please can someone who is competent to judge explain why Uberdude who has never manifested any evidence of professional competence in AI has licence to make repeated personal abuse attacks upon me in this forum over several years without ever receiving a reprimand from a moderator?

Being good at Go does not make one good at being human.


OK. As an admin, I'd like not to take sides on the matter. Let's remain cordial with one another. For all in the thread, please refrain from personal attacks, whether that means questioning an individual's professional credentials, or questioning their qualities as human beings.

Let's keep the discussion related to topics related to AlphaZero, AI, Go, and the like - not about individuals.
be immersed
dfan
Gosei
Posts: 1598
Joined: Wed Apr 21, 2010 8:49 am
Rank: AGA 2k Fox 3d
GD Posts: 61
KGS: dfan
Has thanked: 891 times
Been thanked: 534 times
Contact:

Re: AlphaZero paper discussion (Mastering Go, Chess, and Sho

Post by dfan »

Kirby wrote:I agree that advancements have been made from AI outside of hardware. But it's also hard to separate the two. For example, a number of research advancements may have been made possible with increased computing power and availability.

Oh, this is 100% the case. There was a lot of neural network research in the 1980s that seemed like a dead end because it would require too much computing power to be useful. Thirty years later...

As to whether 1971 researchers could replicate AlphaGo, I'd say that dfan is probably right, but who really knows?

Maybe they'd get there eventually! But a lot of AI research has been done in the last 46 years, and I'm pretty reluctant to believe that it all could have been reproduced in a couple of years or something if only computers were faster. A lot of very smart people did a lot of very hard thinking for a long time to get where we are now.
Kirby
Honinbo
Posts: 9553
Joined: Wed Feb 24, 2010 6:04 pm
GD Posts: 0
KGS: Kirby
Tygem: 커비라고해
Has thanked: 1583 times
Been thanked: 1707 times

Re: AlphaZero paper discussion (Mastering Go, Chess, and Sho

Post by Kirby »

Probably correct, dfan.
be immersed
pookpooi
Lives in sente
Posts: 727
Joined: Sat Aug 21, 2010 12:26 pm
GD Posts: 10
Has thanked: 44 times
Been thanked: 218 times

Re: AlphaZero paper discussion (Mastering Go, Chess, and Sho

Post by pookpooi »

In a rare occasion American Go Association post an article from chess.com on their E-Journal but IMHO they should add details on AZ outclass AGZ 20 blocks as well, otherwise the article is not related to Go at all.
User avatar
djhbrown
Lives in gote
Posts: 392
Joined: Tue Sep 15, 2015 5:00 pm
Rank: NR
GD Posts: 0
Has thanked: 23 times
Been thanked: 43 times

Re: AlphaZero paper discussion (Mastering Go, Chess, and Sho

Post by djhbrown »

dfan wrote:There was a lot of neural network research in the 1980s that seemed like a dead end because it would require too much computing power to be useful.
That's an overstatement and a misattribution of cause, since it was not lack of computing power but lack of functionality that held NNs up - the breakthrough was Le Cann's idea of convolutions. Go-wise, the idea of trying out Monte-Carlo search was also a quantum leap, so putting 2 and 2 together was a smart move.

So far, so good...

Where to next? A Go program that can talk? Or one that can see eyes? NNs can separate images containing a cat from those that don't, but can they draw a line around the cat?
i shrink, therefore i swarm
Gomoto
Gosei
Posts: 1733
Joined: Sun Nov 06, 2016 6:56 am
GD Posts: 0
Location: Earth
Has thanked: 621 times
Been thanked: 310 times

Re: AlphaZero paper discussion (Mastering Go, Chess, and Sho

Post by Gomoto »

I for one do not care if AI is not there yet.

If it plays go and wins, I like it. :ugeek:
Gomoto
Gosei
Posts: 1733
Joined: Sun Nov 06, 2016 6:56 am
GD Posts: 0
Location: Earth
Has thanked: 621 times
Been thanked: 310 times

Re: AlphaZero paper discussion (Mastering Go, Chess, and Sho

Post by Gomoto »

I learned some things about life, while playing go.

Therefore if it plays go, it knows some things about life too. ;-)
jeromie
Lives in sente
Posts: 902
Joined: Fri Jan 31, 2014 7:12 pm
Rank: AGA 3k
GD Posts: 0
Universal go server handle: jeromie
Location: Fort Collins, CO
Has thanked: 319 times
Been thanked: 287 times

Re: AlphaZero paper discussion (Mastering Go, Chess, and Sho

Post by jeromie »

djhbrown wrote: can they draw a line around the cat?

This looks pretty good to me. :)
https://github.com/s9xie/hed

On a side note, my focus area for my master's degree in computer engineering was intelligent systems. Unfortunately, neural networks were out of vogue at my school in the early 2000s. I wasn't thrilled with my research area; I sincerely wish I could have studied something akin to what is happening in machine learning now!
User avatar
HermanHiddema
Gosei
Posts: 2011
Joined: Tue Apr 20, 2010 10:08 am
Rank: Dutch 4D
GD Posts: 645
Universal go server handle: herminator
Location: Groningen, NL
Has thanked: 202 times
Been thanked: 1086 times

Re: AlphaZero paper discussion (Mastering Go, Chess, and Sho

Post by HermanHiddema »

djhbrown wrote:
Uberdude wrote:His recent output is not of professional quality.
Please can someone who is competent to judge explain why Uberdude who has never manifested any evidence of professional competence in AI has licence to make repeated personal abuse attacks upon me in this forum over several years without ever receiving a reprimand from a moderator?

Being good at Go does not make one good at being human.


Questioning the quality of someone's work is not a personal attack.

If you post content, people should be free to criticize that content.
Revilo
Dies in gote
Posts: 37
Joined: Sun Jul 24, 2016 11:03 pm
Rank: IGS 9k EGF 6k
GD Posts: 0
KGS: Revilo
IGS: Revilo
Location: Frankfurt, Germany
Has thanked: 2 times
Been thanked: 19 times
Contact:

Re: AlphaZero paper discussion (not the same as AlphaGo Zero

Post by Revilo »

Uberdude wrote:I have a few questions for the chess experts here:
- looking at the chess openings is AlphaZero playing the long standard opening book lines or has it found a way to diverge early without playing bad moves? My impression of chess was human knowledge of the opening was closer to perfect play than in go and is sharper so there was less scope for novelty into unexplored areas without playing suboptimal moves.


I've just browsed through the games (https://lichess.org/study/EOddRjJ8) and I have to say that I'm quite impressed. It actually hasn't really innovated in the opening, but what can be clearly seen is that it has no qualms about positional sacrifices, even large ones. This is something that the alphabeta bean counters do not tend to do just like that - the reason being that piece values are fix and positional bonuses and maluses rarely add up enough to compensate for a piece - a pawn or an exchange (i.e. of rook against bishop or knight) sometimes maybe, but a full piece? Doesn't happen.

Alpha apparently likes to play a gambit in the Queen's Indian Defense that has been around for a long time already, and which has become very popular in the wake of early games of Garry Kasparov, who used this as his main weapon in the early 80s before he became world champion.

So what does Alpha make of it? We'll have a look at game 10 (see above link).

White's gambit move is the 7th, pushing the pawn to d5. Alpha's first major deviation from established theory is the 12th move, with the 14th being the first completely new move. So what happened then? Only five moves later Alpha sacrifices its knight on h6 for no immediate material compensation - it's just that Black's rook and knight are still at home, the Black king looks vulnerable on h6 and every White piece is going to be efficiently developed about two or three moves later.

I will have to check what the latest Komodo or Houdini think about this sacrifice, but I'm confident that they are not going to like it much. Moreover, in the moves following the sacrifice, White just continues to develop calmly. It would take a lot of confidence and positional judgement for a human grandmaster to play like this, but it is conceivable. Human-style play for sure. Long lasting positional compensation isn't something the bean counters like very much, though.

A similar positional piece sacrifice can be seen in game 9, White's 30th move. The point is White's very nice follow up on the 32nd move, after which White ends up a piece down but Black is tied up nicely and its extra piece, the bishop at b7, doesn't do any relevant work. Alpha converts its advantage without any further fireworks 20 moves later. This sacrifice is probably also quite a leap for a traditional engine (alphabeta + hand crafted evaluation function).


Uberdude wrote:- Is the play of stockfish near its peak strength, i.e if it has more time or resources does it get significantly better (anyone try at home) and not play the moves that let AZ beat it? I wonder if perhaps neural networks bots are better at blitz than tree search bots (in training you essentially transfer the skill from tree search into one huge function which is quick to compute).*
Edit: Now I read the paper the 100 game match was not 1 seconds a move like for the Elo evaluation in the graph, but 1 minute a move with 64 threads and 1 GB hash which sounds better but still I'm not clear how far from peak strength and diminishing returns that is (and could be a lot smaller than the 4 TPUs AZ got). Looking at the kibitz on the TCEC match many chess players are dismissive of the conditions, saying the specs for stockfish engine are unfair/small.


I've been out of the trade for a while but I'd say that the specs do not seem too shabby. Also, 70000k evals per second vs. 80k - well, how much additional hardware do they actually suggest to throw at Alpha? :) Of course, the reason is that multithreaded alphabeta search does not scale up so well and the search has to be deep enough to compensate for the relative dumbness of the evaluation function. So they would probably have rather had something like tournament time controls (2h for 40 moves and so on) instead of a fixed time per move - then Stockfish could have used its time management (make forced moves immediately, use saved up time when problems appear (a sudden "fail-low" because a deep resource was discovered by the search - in these cases engines often use additional time to search deeper and fix the variation).
Last edited by Revilo on Fri Dec 08, 2017 5:49 am, edited 1 time in total.
moha
Lives in gote
Posts: 311
Joined: Wed May 31, 2017 6:49 am
Rank: 2d
GD Posts: 0
Been thanked: 45 times

Re: AlphaZero paper discussion (Mastering Go, Chess, and Sho

Post by moha »

dfan wrote:Maybe they'd get there eventually! But a lot of AI research has been done in the last 46 years, and I'm pretty reluctant to believe that it all could have been reproduced in a couple of years or something if only computers were faster. A lot of very smart people did a lot of very hard thinking for a long time to get where we are now.
I think advances in theory also depend on hardware though. If it takes a week to try out something, progress will be slower than if answers appear in a minute or two. :)

Nonetheless, I still find it amazing how close TD-Gammon was to AGZ, 25 years ago (the biggest difference probably residual convnets - go would probably be impossible with the simple networks of that time). But since NNs started to move again recently (with some of their longstanding problems solved), we will probably see more drastic advances from now on.

One strong point of this network + search approach seems that it's hard to imagine a game now, where this wouldn't dominate humans. Even for games created especially as "tough for computers". A human facing a complex task does the same (apply some intuition, then think ahead and compare in a few promising lines - with less depth and accuracy OC). So if a human can do it, a program can probably do it too. Even Starcraft :) - but that will probably need the next advance in NNs.
moha
Lives in gote
Posts: 311
Joined: Wed May 31, 2017 6:49 am
Rank: 2d
GD Posts: 0
Been thanked: 45 times

Re: AlphaZero paper discussion (not the same as AlphaGo Zero

Post by moha »

Revilo wrote:I've been out of the trade for a while but I'd say that the specs do not seem too shabby.
Where were they actually described? I only recall seeing things like "1G hash, 64 threads" - but 64 at which kind of hardware? (OC, we can roughly guess from the pos/s given.)
Post Reply