AlphaZero paper discussion (Mastering Go, Chess, and Shogi)
-
Uberdude
- Judan
- Posts: 6727
- Joined: Thu Nov 24, 2011 11:35 am
- Rank: UK 4 dan
- GD Posts: 0
- KGS: Uberdude 4d
- OGS: Uberdude 7d
- Location: Cambridge, UK
- Has thanked: 436 times
- Been thanked: 3718 times
Re: AlphaZero paper discussion (Mastering Go, Chess, and Sho
I don't understand much of backgammon, bur found it interesting that TD-Gammon upended some opening theory and made humans start playing moves they previously thought were bad, rather like AlphaGo and the early 3-3 invasions.
- djhbrown
- Lives in gote
- Posts: 392
- Joined: Tue Sep 15, 2015 5:00 pm
- Rank: NR
- GD Posts: 0
- Has thanked: 23 times
- Been thanked: 43 times
Re: AlphaZero paper discussion (Mastering Go, Chess, and Sho
Please can someone who is competent to judge explain why Uberdude who has never manifested any evidence of professional competence in AI has licence to make repeated personal abuse attacks upon me in this forum over several years without ever receiving a reprimand from a moderator?Uberdude wrote:His recent output is not of professional quality.
Being good at Go does not make one good at being human.
i shrink, therefore i swarm
-
Kirby
- Honinbo
- Posts: 9553
- Joined: Wed Feb 24, 2010 6:04 pm
- GD Posts: 0
- KGS: Kirby
- Tygem: 커비라고해
- Has thanked: 1583 times
- Been thanked: 1707 times
Re: AlphaZero paper discussion (Mastering Go, Chess, and Sho
dfan wrote:moha wrote:They weren't hopeless though.dfan wrote:if you handed a bunch of 1971-era AI researchers 2017-level hardware, they wouldn't be able to create AlphaGo
Indeed, though note that TD-Gammon was developed 21 years after 1971 and benefited from (and extended!) the theories of reinforcement learning developed during those decades. (I saw Tesauro talk at a conference last year about his experience creating TD-Gammon and it was quite interesting.)
I agree that advancements have been made from AI outside of hardware. But it's also hard to separate the two. For example, a number of research advancements may have been made possible with increased computing power and availability.
It's somewhat of a hypothetical argument, similar to how people talk about who would be stronger between a "modern day Shusaku" and, for example, Ke Jie. It's nice to think about as a thought experiment and make various reasoning, but the idea can't extend the limits of its hypothetical nature.
As to whether 1971 researchers could replicate AlphaGo, I'd say that dfan is probably right, but who really knows?
be immersed
-
Kirby
- Honinbo
- Posts: 9553
- Joined: Wed Feb 24, 2010 6:04 pm
- GD Posts: 0
- KGS: Kirby
- Tygem: 커비라고해
- Has thanked: 1583 times
- Been thanked: 1707 times
Re: AlphaZero paper discussion (Mastering Go, Chess, and Sho
djhbrown wrote:Please can someone who is competent to judge explain why Uberdude who has never manifested any evidence of professional competence in AI has licence to make repeated personal abuse attacks upon me in this forum over several years without ever receiving a reprimand from a moderator?Uberdude wrote:His recent output is not of professional quality.
Being good at Go does not make one good at being human.
OK. As an admin, I'd like not to take sides on the matter. Let's remain cordial with one another. For all in the thread, please refrain from personal attacks, whether that means questioning an individual's professional credentials, or questioning their qualities as human beings.
Let's keep the discussion related to topics related to AlphaZero, AI, Go, and the like - not about individuals.
be immersed
-
dfan
- Gosei
- Posts: 1598
- Joined: Wed Apr 21, 2010 8:49 am
- Rank: AGA 2k Fox 3d
- GD Posts: 61
- KGS: dfan
- Has thanked: 891 times
- Been thanked: 534 times
- Contact:
Re: AlphaZero paper discussion (Mastering Go, Chess, and Sho
Kirby wrote:I agree that advancements have been made from AI outside of hardware. But it's also hard to separate the two. For example, a number of research advancements may have been made possible with increased computing power and availability.
Oh, this is 100% the case. There was a lot of neural network research in the 1980s that seemed like a dead end because it would require too much computing power to be useful. Thirty years later...
As to whether 1971 researchers could replicate AlphaGo, I'd say that dfan is probably right, but who really knows?
Maybe they'd get there eventually! But a lot of AI research has been done in the last 46 years, and I'm pretty reluctant to believe that it all could have been reproduced in a couple of years or something if only computers were faster. A lot of very smart people did a lot of very hard thinking for a long time to get where we are now.
-
Kirby
- Honinbo
- Posts: 9553
- Joined: Wed Feb 24, 2010 6:04 pm
- GD Posts: 0
- KGS: Kirby
- Tygem: 커비라고해
- Has thanked: 1583 times
- Been thanked: 1707 times
-
pookpooi
- Lives in sente
- Posts: 727
- Joined: Sat Aug 21, 2010 12:26 pm
- GD Posts: 10
- Has thanked: 44 times
- Been thanked: 218 times
Re: AlphaZero paper discussion (Mastering Go, Chess, and Sho
In a rare occasion American Go Association post an article from chess.com on their E-Journal but IMHO they should add details on AZ outclass AGZ 20 blocks as well, otherwise the article is not related to Go at all.
- djhbrown
- Lives in gote
- Posts: 392
- Joined: Tue Sep 15, 2015 5:00 pm
- Rank: NR
- GD Posts: 0
- Has thanked: 23 times
- Been thanked: 43 times
Re: AlphaZero paper discussion (Mastering Go, Chess, and Sho
That's an overstatement and a misattribution of cause, since it was not lack of computing power but lack of functionality that held NNs up - the breakthrough was Le Cann's idea of convolutions. Go-wise, the idea of trying out Monte-Carlo search was also a quantum leap, so putting 2 and 2 together was a smart move.dfan wrote:There was a lot of neural network research in the 1980s that seemed like a dead end because it would require too much computing power to be useful.
So far, so good...
Where to next? A Go program that can talk? Or one that can see eyes? NNs can separate images containing a cat from those that don't, but can they draw a line around the cat?
i shrink, therefore i swarm
-
Gomoto
- Gosei
- Posts: 1733
- Joined: Sun Nov 06, 2016 6:56 am
- GD Posts: 0
- Location: Earth
- Has thanked: 621 times
- Been thanked: 310 times
Re: AlphaZero paper discussion (Mastering Go, Chess, and Sho
I learned some things about life, while playing go.
Therefore if it plays go, it knows some things about life too.
Therefore if it plays go, it knows some things about life too.
-
jeromie
- Lives in sente
- Posts: 902
- Joined: Fri Jan 31, 2014 7:12 pm
- Rank: AGA 3k
- GD Posts: 0
- Universal go server handle: jeromie
- Location: Fort Collins, CO
- Has thanked: 319 times
- Been thanked: 287 times
Re: AlphaZero paper discussion (Mastering Go, Chess, and Sho
djhbrown wrote: can they draw a line around the cat?
This looks pretty good to me.
https://github.com/s9xie/hed
On a side note, my focus area for my master's degree in computer engineering was intelligent systems. Unfortunately, neural networks were out of vogue at my school in the early 2000s. I wasn't thrilled with my research area; I sincerely wish I could have studied something akin to what is happening in machine learning now!
- HermanHiddema
- Gosei
- Posts: 2011
- Joined: Tue Apr 20, 2010 10:08 am
- Rank: Dutch 4D
- GD Posts: 645
- Universal go server handle: herminator
- Location: Groningen, NL
- Has thanked: 202 times
- Been thanked: 1086 times
Re: AlphaZero paper discussion (Mastering Go, Chess, and Sho
djhbrown wrote:Please can someone who is competent to judge explain why Uberdude who has never manifested any evidence of professional competence in AI has licence to make repeated personal abuse attacks upon me in this forum over several years without ever receiving a reprimand from a moderator?Uberdude wrote:His recent output is not of professional quality.
Being good at Go does not make one good at being human.
Questioning the quality of someone's work is not a personal attack.
If you post content, people should be free to criticize that content.
-
Revilo
- Dies in gote
- Posts: 37
- Joined: Sun Jul 24, 2016 11:03 pm
- Rank: IGS 9k EGF 6k
- GD Posts: 0
- KGS: Revilo
- IGS: Revilo
- Location: Frankfurt, Germany
- Has thanked: 2 times
- Been thanked: 19 times
- Contact:
Re: AlphaZero paper discussion (not the same as AlphaGo Zero
Uberdude wrote:I have a few questions for the chess experts here:
- looking at the chess openings is AlphaZero playing the long standard opening book lines or has it found a way to diverge early without playing bad moves? My impression of chess was human knowledge of the opening was closer to perfect play than in go and is sharper so there was less scope for novelty into unexplored areas without playing suboptimal moves.
I've just browsed through the games (https://lichess.org/study/EOddRjJ8) and I have to say that I'm quite impressed. It actually hasn't really innovated in the opening, but what can be clearly seen is that it has no qualms about positional sacrifices, even large ones. This is something that the alphabeta bean counters do not tend to do just like that - the reason being that piece values are fix and positional bonuses and maluses rarely add up enough to compensate for a piece - a pawn or an exchange (i.e. of rook against bishop or knight) sometimes maybe, but a full piece? Doesn't happen.
Alpha apparently likes to play a gambit in the Queen's Indian Defense that has been around for a long time already, and which has become very popular in the wake of early games of Garry Kasparov, who used this as his main weapon in the early 80s before he became world champion.
So what does Alpha make of it? We'll have a look at game 10 (see above link).
White's gambit move is the 7th, pushing the pawn to d5. Alpha's first major deviation from established theory is the 12th move, with the 14th being the first completely new move. So what happened then? Only five moves later Alpha sacrifices its knight on h6 for no immediate material compensation - it's just that Black's rook and knight are still at home, the Black king looks vulnerable on h6 and every White piece is going to be efficiently developed about two or three moves later.
I will have to check what the latest Komodo or Houdini think about this sacrifice, but I'm confident that they are not going to like it much. Moreover, in the moves following the sacrifice, White just continues to develop calmly. It would take a lot of confidence and positional judgement for a human grandmaster to play like this, but it is conceivable. Human-style play for sure. Long lasting positional compensation isn't something the bean counters like very much, though.
A similar positional piece sacrifice can be seen in game 9, White's 30th move. The point is White's very nice follow up on the 32nd move, after which White ends up a piece down but Black is tied up nicely and its extra piece, the bishop at b7, doesn't do any relevant work. Alpha converts its advantage without any further fireworks 20 moves later. This sacrifice is probably also quite a leap for a traditional engine (alphabeta + hand crafted evaluation function).
Uberdude wrote:- Is the play of stockfish near its peak strength, i.e if it has more time or resources does it get significantly better (anyone try at home) and not play the moves that let AZ beat it? I wonder if perhaps neural networks bots are better at blitz than tree search bots (in training you essentially transfer the skill from tree search into one huge function which is quick to compute).*
Edit: Now I read the paper the 100 game match was not 1 seconds a move like for the Elo evaluation in the graph, but 1 minute a move with 64 threads and 1 GB hash which sounds better but still I'm not clear how far from peak strength and diminishing returns that is (and could be a lot smaller than the 4 TPUs AZ got). Looking at the kibitz on the TCEC match many chess players are dismissive of the conditions, saying the specs for stockfish engine are unfair/small.
I've been out of the trade for a while but I'd say that the specs do not seem too shabby. Also, 70000k evals per second vs. 80k - well, how much additional hardware do they actually suggest to throw at Alpha?
Last edited by Revilo on Fri Dec 08, 2017 5:49 am, edited 1 time in total.
-
moha
- Lives in gote
- Posts: 311
- Joined: Wed May 31, 2017 6:49 am
- Rank: 2d
- GD Posts: 0
- Been thanked: 45 times
Re: AlphaZero paper discussion (Mastering Go, Chess, and Sho
I think advances in theory also depend on hardware though. If it takes a week to try out something, progress will be slower than if answers appear in a minute or two.dfan wrote:Maybe they'd get there eventually! But a lot of AI research has been done in the last 46 years, and I'm pretty reluctant to believe that it all could have been reproduced in a couple of years or something if only computers were faster. A lot of very smart people did a lot of very hard thinking for a long time to get where we are now.
Nonetheless, I still find it amazing how close TD-Gammon was to AGZ, 25 years ago (the biggest difference probably residual convnets - go would probably be impossible with the simple networks of that time). But since NNs started to move again recently (with some of their longstanding problems solved), we will probably see more drastic advances from now on.
One strong point of this network + search approach seems that it's hard to imagine a game now, where this wouldn't dominate humans. Even for games created especially as "tough for computers". A human facing a complex task does the same (apply some intuition, then think ahead and compare in a few promising lines - with less depth and accuracy OC). So if a human can do it, a program can probably do it too. Even Starcraft
-
moha
- Lives in gote
- Posts: 311
- Joined: Wed May 31, 2017 6:49 am
- Rank: 2d
- GD Posts: 0
- Been thanked: 45 times
Re: AlphaZero paper discussion (not the same as AlphaGo Zero
Where were they actually described? I only recall seeing things like "1G hash, 64 threads" - but 64 at which kind of hardware? (OC, we can roughly guess from the pos/s given.)Revilo wrote:I've been out of the trade for a while but I'd say that the specs do not seem too shabby.