Different Quality of Play between Current AI and AlphaGo 0
-
RobertJasiek
- Judan
- Posts: 6273
- Joined: Tue Apr 27, 2010 8:54 pm
- GD Posts: 0
- Been thanked: 797 times
- Contact:
Different Quality of Play between Current AI and AlphaGo 0
Although some current AI programs are said to be 2 stones stronger than top pro level. However, they are much weaker than AlphaGo Zero. E.g., Leela Zero (via AI Sensei) is described as having occasional problems with ladders and sekis, although not every weak or lazy user notices them. I have witnessed another occasional weakness: wrong LD status assessment. Therefore, never simply trust AI. When will current AI reach AlphaGo Zero level on typical computers?
-
Uberdude
- Judan
- Posts: 6727
- Joined: Thu Nov 24, 2011 11:35 am
- Rank: UK 4 dan
- GD Posts: 0
- KGS: Uberdude 4d
- OGS: Uberdude 7d
- Location: Cambridge, UK
- Has thanked: 436 times
- Been thanked: 3718 times
Re: Different Quality of Play between Current AI and AlphaGo
To be fair we don't know how strong AG0 is on typical computers, just on high end hardware. The AlphaGo teaching tool (better Master not Zero) is on 10 million playouts, maybe it too made embarrassing ladder mistakes on 3000 playouts like LZ does and DeepMind wanted to hide that bad publicity. Does anyone know how many playouts Master was getting in the online series? I seem to recall that was on a single machine with a TPU or two in sub 30 seconds.
-
Mike Novack
- Lives in sente
- Posts: 1045
- Joined: Mon Aug 09, 2010 9:36 am
- GD Posts: 0
- Been thanked: 182 times
Re: Different Quality of Play between Current AI and AlphaGo
Well perhaps we should distinguish three, not jus two levels of computers.
a) Typical computers
b) high end gaming machines (probably a couple K to buy and maybe 300w to run)
c) high end hardware, equivalent of a mini supercomputer. Maybe even water cooled.
a) Typical computers
b) high end gaming machines (probably a couple K to buy and maybe 300w to run)
c) high end hardware, equivalent of a mini supercomputer. Maybe even water cooled.
Re: Different Quality of Play between Current AI and AlphaGo
There were some claims on LZ discord that most of today's bots, including LZ and Minigo already reached AG0 levels (raw nets I guess, hw oc makes difference).
This question has some practical consequences since LZ for example is struggling to gain further strength at the moment, and it is unclear if this is already the limitation of 40b networks (which AG0 used as well), or a consequence of some other problem.
This question has some practical consequences since LZ for example is struggling to gain further strength at the moment, and it is unclear if this is already the limitation of 40b networks (which AG0 used as well), or a consequence of some other problem.
-
Bill Spight
- Honinbo
- Posts: 10905
- Joined: Wed Apr 21, 2010 1:24 pm
- Has thanked: 3651 times
- Been thanked: 3373 times
Re: Different Quality of Play between Current AI and AlphaGo
hw = hardware?jann wrote:There were some claims on LZ discord that most of today's bots, including LZ and Minigo already reached AG0 levels (raw nets I guess, hw oc makes difference).
There is no guarantee that the fitness landscape for go is unimodal. Perhaps LZ has climbed the wrong hill. OC, in infinite time it will find its way to another hill.This question has some practical consequences since LZ for example is struggling to gain further strength at the moment, and it is unclear if this is already the limitation of 40b networks (which AG0 used as well), or a consequence of some other problem.
The Adkins Principle:
At some point, doesn't thinking have to go on?
— Winona Adkins
Visualize whirled peas.
Everything with love. Stay safe.
At some point, doesn't thinking have to go on?
— Winona Adkins
Visualize whirled peas.
Everything with love. Stay safe.
- jlt
- Gosei
- Posts: 1786
- Joined: Wed Dec 14, 2016 3:59 am
- GD Posts: 0
- Has thanked: 185 times
- Been thanked: 495 times
Re: Different Quality of Play between Current AI and AlphaGo
LeelaZero has 16 million self-play games, while AlphaZero has 29 million, so Leelazero may be a bit less strong than AlphaZero, but is probably not very far.
How many playouts are necessary to avoid problems with ladders, life and death, semeais, etc.?
How many playouts are necessary to avoid problems with ladders, life and death, semeais, etc.?
-
Bill Spight
- Honinbo
- Posts: 10905
- Joined: Wed Apr 21, 2010 1:24 pm
- Has thanked: 3651 times
- Been thanked: 3373 times
Re: Different Quality of Play between Current AI and AlphaGo
There may be path dependency.jlt wrote:LeelaZero has 16 million self-play games, while AlphaZero has 29 million, so Leelazero may be a bit less strong than AlphaZero, but is probably not very far.
Well, what do you mean by a problem? Certainly if human amateurs can do better with such things in some positions, that may be considered a problem. John Tromp has shown that ladders can pose very difficult problems, and even some "simple" ladder problems are well over 100 go moves deep. Similarly, Berlekamp and Wolfe composed endgame problems that are easy for humans who know the secret, but are almost 100 moves deep, and the candidate plays at each turn are very similar, so that it is unclear how much MCTS resembles brute force for such positions.How many playouts are necessary to avoid problems with ladders, life and death, semeais, etc.?
For the Elf commentaries, candidate plays, except for the actual game play, with fewer than 1500 playouts were not reported. If Elf's top choice had few competitors, it typically got more than 100k playouts. OC, that was not enough to avoid occasional reading errors where the human choice was demonstrably better.
The Adkins Principle:
At some point, doesn't thinking have to go on?
— Winona Adkins
Visualize whirled peas.
Everything with love. Stay safe.
At some point, doesn't thinking have to go on?
— Winona Adkins
Visualize whirled peas.
Everything with love. Stay safe.
- jlt
- Gosei
- Posts: 1786
- Joined: Wed Dec 14, 2016 3:59 am
- GD Posts: 0
- Has thanked: 185 times
- Been thanked: 495 times
Re: Different Quality of Play between Current AI and AlphaGo
Bill Spight wrote:Well, what do you mean by a problem?How many playouts are necessary to avoid problems with ladders, life and death, semeais, etc.?
Let me ask a more precise question. Are there examples of human games (pro or amateur) in which a recent version of LeelaZero with 100000 playouts chooses a wrong move because of a misread ladder, or life-and-death problem, or semeai?
If the answer is yes, then by which number should I replace 100000 so that the answer becomes "no"?
-
Gomoto
- Gosei
- Posts: 1733
- Joined: Sun Nov 06, 2016 6:56 am
- GD Posts: 0
- Location: Earth
- Has thanked: 621 times
- Been thanked: 310 times
Re: Different Quality of Play between Current AI and AlphaGo
I trust AI, even if it makes some mistakes from time to time. I even trust my fellow human beeings, even if they make some mistakes from time to time. The concept to only accept flawless results is inherently dangerous, allthough many people do not realize this, instead of the obvious examples in history and personal experiences.Therefore, never simply trust AI. When will current AI reach AlphaGo Zero level on typical computers?
It is better to be aware that mistakes will happen and react to them in proper ways.
(I acknowledge that the quote says never SIMPLY trust AI.)
Re: Different Quality of Play between Current AI and AlphaGo
Yes.Bill Spight wrote:hw = hardware?
In my experience random differences in end results are usually small with different training runs using identical parameters. Seemingly minor differences having unexpectedly large effect on the final net is more common. (In that case the above mentioned question: what did LZ do wrong?)There is no guarantee that the fitness landscape for go is unimodal. Perhaps LZ has climbed the wrong hill.This question has some practical consequences since LZ for example is struggling to gain further strength at the moment, and it is unclear if this is already the limitation of 40b networks (which AG0 used as well), or a consequence of some other problem.
It is not easy to compare LZ nets to AG0 nets since the published games not only use high playouts, but are also hand-selected (which may explain some brilliancies). But I doubt LZ would be near AG0 atm. Comparing selfplay numbers also unreliable since there ARE/were significant differences, so a factor of 2 in selfplay amount can easily slip left or right.jlt wrote:LeelaZero has 16 million self-play games, while AlphaZero has 29 million, so Leelazero may be a bit less strong than AlphaZero, but is probably not very far.
The question about ladders is even more difficult. There were some recent examples where LZ (on normal low-ish playouts) misplayed ladders with only a few disturbing stones in vicinity. Low and even mid playouts must rely on net guess about ladder to an extent, since actual playouts may not go along the ladder line for the player the net thinks the ladder is bad for (thus wont see its real outcome until very high playouts).
-
Bill Spight
- Honinbo
- Posts: 10905
- Joined: Wed Apr 21, 2010 1:24 pm
- Has thanked: 3651 times
- Been thanked: 3373 times
Re: Different Quality of Play between Current AI and AlphaGo
Yes, I lost faith in the DeepMind team when they only published selected games, which made them useless for scientific purposes. Good PR, I suppose. You got the same kind of thing in chess. They knew better.jann wrote:It is not easy to compare LZ nets to AG0 nets since the published games not only use high playouts, but are also hand-selected (which may explain some brilliancies).jlt wrote:LeelaZero has 16 million self-play games, while AlphaZero has 29 million, so Leelazero may be a bit less strong than AlphaZero, but is probably not very far.
At least the Master vs. human games were not selected, and the Elf commentaries are comprehensive. (Different team.
The Adkins Principle:
At some point, doesn't thinking have to go on?
— Winona Adkins
Visualize whirled peas.
Everything with love. Stay safe.
At some point, doesn't thinking have to go on?
— Winona Adkins
Visualize whirled peas.
Everything with love. Stay safe.
-
Kirby
- Honinbo
- Posts: 9553
- Joined: Wed Feb 24, 2010 6:04 pm
- GD Posts: 0
- KGS: Kirby
- Tygem: 커비라고해
- Has thanked: 1583 times
- Been thanked: 1707 times
Re: Different Quality of Play between Current AI and AlphaGo
Except that they only chose to reveal that Master was associated with AlphaGo *after* it showed an impressive resultBill Spight wrote:At least the Master vs. human games were not selected
be immersed
-
Bill Spight
- Honinbo
- Posts: 10905
- Joined: Wed Apr 21, 2010 1:24 pm
- Has thanked: 3651 times
- Been thanked: 3373 times
Re: Different Quality of Play between Current AI and AlphaGo
That's true.Kirby wrote:Except that they only chose to reveal that Master was associated with AlphaGo *after* it showed an impressive resultBill Spight wrote:At least the Master vs. human games were not selected
But the games were still representative. No selection bias.
The Adkins Principle:
At some point, doesn't thinking have to go on?
— Winona Adkins
Visualize whirled peas.
Everything with love. Stay safe.
At some point, doesn't thinking have to go on?
— Winona Adkins
Visualize whirled peas.
Everything with love. Stay safe.
-
hydrogenpi7
- Dies in gote
- Posts: 63
- Joined: Sat Mar 25, 2017 3:19 pm
- GD Posts: 0
- Been thanked: 3 times
Re: Different Quality of Play between Current AI and AlphaGo
jann wrote:There were some claims on LZ discord that most of today's bots, including LZ and Minigo already reached AG0 levels (raw nets I guess, hw oc makes difference).
This question has some practical consequences since LZ for example is struggling to gain further strength at the moment, and it is unclear if this is already the limitation of 40b networks (which AG0 used as well), or a consequence of some other problem.
So using something like a Huawei Atlas 900 cluster and the latest LZ, that means it can beat the AlphaGoZero?
Would Huawei and Google we willing to do a match?
-
hydrogenpi7
- Dies in gote
- Posts: 63
- Joined: Sat Mar 25, 2017 3:19 pm
- GD Posts: 0
- Been thanked: 3 times
Re: Different Quality of Play between Current AI and AlphaGo
also for what it is worth, had they played Lee with the Fan version of AG, good chance Lee would have won the match, had they played Ke Jie with the Lee version of AlphaGo, good chance Ke Jie might have won some games etc... I think they were conservative in that they wanted to assure wins at each step up the AI strength ladder, while in retrospect it seems clear the Fan version of AlphaGo was very weak and not "superhuman", certaintly current LZ is already past even the Lee version of AlphaGoBill Spight wrote:Yes, I lost faith in the DeepMind team when they only published selected games, which made them useless for scientific purposes. Good PR, I suppose. You got the same kind of thing in chess. They knew better.jann wrote:It is not easy to compare LZ nets to AG0 nets since the published games not only use high playouts, but are also hand-selected (which may explain some brilliancies).jlt wrote:LeelaZero has 16 million self-play games, while AlphaZero has 29 million, so Leelazero may be a bit less strong than AlphaZero, but is probably not very far.
At least the Master vs. human games were not selected, and the Elf commentaries are comprehensive. (Different team.)