Different Quality of Play between Current AI and AlphaGo 0

For discussing go computing, software announcements, etc.
RobertJasiek
Judan
Posts: 6272
Joined: Tue Apr 27, 2010 8:54 pm
GD Posts: 0
Been thanked: 797 times
Contact:

Different Quality of Play between Current AI and AlphaGo 0

Post by RobertJasiek »

Although some current AI programs are said to be 2 stones stronger than top pro level. However, they are much weaker than AlphaGo Zero. E.g., Leela Zero (via AI Sensei) is described as having occasional problems with ladders and sekis, although not every weak or lazy user notices them. I have witnessed another occasional weakness: wrong LD status assessment. Therefore, never simply trust AI. When will current AI reach AlphaGo Zero level on typical computers?
Uberdude
Judan
Posts: 6727
Joined: Thu Nov 24, 2011 11:35 am
Rank: UK 4 dan
GD Posts: 0
KGS: Uberdude 4d
OGS: Uberdude 7d
Location: Cambridge, UK
Has thanked: 436 times
Been thanked: 3718 times

Re: Different Quality of Play between Current AI and AlphaGo

Post by Uberdude »

To be fair we don't know how strong AG0 is on typical computers, just on high end hardware. The AlphaGo teaching tool (better Master not Zero) is on 10 million playouts, maybe it too made embarrassing ladder mistakes on 3000 playouts like LZ does and DeepMind wanted to hide that bad publicity. Does anyone know how many playouts Master was getting in the online series? I seem to recall that was on a single machine with a TPU or two in sub 30 seconds.
Mike Novack
Lives in sente
Posts: 1045
Joined: Mon Aug 09, 2010 9:36 am
GD Posts: 0
Been thanked: 182 times

Re: Different Quality of Play between Current AI and AlphaGo

Post by Mike Novack »

Well perhaps we should distinguish three, not jus two levels of computers.

a) Typical computers
b) high end gaming machines (probably a couple K to buy and maybe 300w to run)
c) high end hardware, equivalent of a mini supercomputer. Maybe even water cooled.
jann
Lives in gote
Posts: 445
Joined: Tue May 14, 2019 8:00 pm
GD Posts: 0
Been thanked: 37 times

Re: Different Quality of Play between Current AI and AlphaGo

Post by jann »

There were some claims on LZ discord that most of today's bots, including LZ and Minigo already reached AG0 levels (raw nets I guess, hw oc makes difference).

This question has some practical consequences since LZ for example is struggling to gain further strength at the moment, and it is unclear if this is already the limitation of 40b networks (which AG0 used as well), or a consequence of some other problem.
Bill Spight
Honinbo
Posts: 10905
Joined: Wed Apr 21, 2010 1:24 pm
Has thanked: 3651 times
Been thanked: 3373 times

Re: Different Quality of Play between Current AI and AlphaGo

Post by Bill Spight »

jann wrote:There were some claims on LZ discord that most of today's bots, including LZ and Minigo already reached AG0 levels (raw nets I guess, hw oc makes difference).
hw = hardware?
This question has some practical consequences since LZ for example is struggling to gain further strength at the moment, and it is unclear if this is already the limitation of 40b networks (which AG0 used as well), or a consequence of some other problem.
There is no guarantee that the fitness landscape for go is unimodal. Perhaps LZ has climbed the wrong hill. OC, in infinite time it will find its way to another hill. :)
The Adkins Principle:
At some point, doesn't thinking have to go on?
— Winona Adkins

Visualize whirled peas.

Everything with love. Stay safe.
User avatar
jlt
Gosei
Posts: 1786
Joined: Wed Dec 14, 2016 3:59 am
GD Posts: 0
Has thanked: 185 times
Been thanked: 495 times

Re: Different Quality of Play between Current AI and AlphaGo

Post by jlt »

LeelaZero has 16 million self-play games, while AlphaZero has 29 million, so Leelazero may be a bit less strong than AlphaZero, but is probably not very far.

How many playouts are necessary to avoid problems with ladders, life and death, semeais, etc.?
Bill Spight
Honinbo
Posts: 10905
Joined: Wed Apr 21, 2010 1:24 pm
Has thanked: 3651 times
Been thanked: 3373 times

Re: Different Quality of Play between Current AI and AlphaGo

Post by Bill Spight »

jlt wrote:LeelaZero has 16 million self-play games, while AlphaZero has 29 million, so Leelazero may be a bit less strong than AlphaZero, but is probably not very far.
There may be path dependency.
How many playouts are necessary to avoid problems with ladders, life and death, semeais, etc.?
Well, what do you mean by a problem? Certainly if human amateurs can do better with such things in some positions, that may be considered a problem. John Tromp has shown that ladders can pose very difficult problems, and even some "simple" ladder problems are well over 100 go moves deep. Similarly, Berlekamp and Wolfe composed endgame problems that are easy for humans who know the secret, but are almost 100 moves deep, and the candidate plays at each turn are very similar, so that it is unclear how much MCTS resembles brute force for such positions.

For the Elf commentaries, candidate plays, except for the actual game play, with fewer than 1500 playouts were not reported. If Elf's top choice had few competitors, it typically got more than 100k playouts. OC, that was not enough to avoid occasional reading errors where the human choice was demonstrably better.
The Adkins Principle:
At some point, doesn't thinking have to go on?
— Winona Adkins

Visualize whirled peas.

Everything with love. Stay safe.
User avatar
jlt
Gosei
Posts: 1786
Joined: Wed Dec 14, 2016 3:59 am
GD Posts: 0
Has thanked: 185 times
Been thanked: 495 times

Re: Different Quality of Play between Current AI and AlphaGo

Post by jlt »

Bill Spight wrote:
How many playouts are necessary to avoid problems with ladders, life and death, semeais, etc.?
Well, what do you mean by a problem?

Let me ask a more precise question. Are there examples of human games (pro or amateur) in which a recent version of LeelaZero with 100000 playouts chooses a wrong move because of a misread ladder, or life-and-death problem, or semeai?

If the answer is yes, then by which number should I replace 100000 so that the answer becomes "no"?
Gomoto
Gosei
Posts: 1733
Joined: Sun Nov 06, 2016 6:56 am
GD Posts: 0
Location: Earth
Has thanked: 621 times
Been thanked: 310 times

Re: Different Quality of Play between Current AI and AlphaGo

Post by Gomoto »

Therefore, never simply trust AI. When will current AI reach AlphaGo Zero level on typical computers?
I trust AI, even if it makes some mistakes from time to time. I even trust my fellow human beeings, even if they make some mistakes from time to time. The concept to only accept flawless results is inherently dangerous, allthough many people do not realize this, instead of the obvious examples in history and personal experiences.

It is better to be aware that mistakes will happen and react to them in proper ways.

(I acknowledge that the quote says never SIMPLY trust AI.)
jann
Lives in gote
Posts: 445
Joined: Tue May 14, 2019 8:00 pm
GD Posts: 0
Been thanked: 37 times

Re: Different Quality of Play between Current AI and AlphaGo

Post by jann »

Bill Spight wrote:hw = hardware?
Yes.
This question has some practical consequences since LZ for example is struggling to gain further strength at the moment, and it is unclear if this is already the limitation of 40b networks (which AG0 used as well), or a consequence of some other problem.
There is no guarantee that the fitness landscape for go is unimodal. Perhaps LZ has climbed the wrong hill.
In my experience random differences in end results are usually small with different training runs using identical parameters. Seemingly minor differences having unexpectedly large effect on the final net is more common. (In that case the above mentioned question: what did LZ do wrong?)
jlt wrote:LeelaZero has 16 million self-play games, while AlphaZero has 29 million, so Leelazero may be a bit less strong than AlphaZero, but is probably not very far.
It is not easy to compare LZ nets to AG0 nets since the published games not only use high playouts, but are also hand-selected (which may explain some brilliancies). But I doubt LZ would be near AG0 atm. Comparing selfplay numbers also unreliable since there ARE/were significant differences, so a factor of 2 in selfplay amount can easily slip left or right.

The question about ladders is even more difficult. There were some recent examples where LZ (on normal low-ish playouts) misplayed ladders with only a few disturbing stones in vicinity. Low and even mid playouts must rely on net guess about ladder to an extent, since actual playouts may not go along the ladder line for the player the net thinks the ladder is bad for (thus wont see its real outcome until very high playouts).
Bill Spight
Honinbo
Posts: 10905
Joined: Wed Apr 21, 2010 1:24 pm
Has thanked: 3651 times
Been thanked: 3373 times

Re: Different Quality of Play between Current AI and AlphaGo

Post by Bill Spight »

jann wrote:
jlt wrote:LeelaZero has 16 million self-play games, while AlphaZero has 29 million, so Leelazero may be a bit less strong than AlphaZero, but is probably not very far.
It is not easy to compare LZ nets to AG0 nets since the published games not only use high playouts, but are also hand-selected (which may explain some brilliancies).
Yes, I lost faith in the DeepMind team when they only published selected games, which made them useless for scientific purposes. Good PR, I suppose. You got the same kind of thing in chess. They knew better.

At least the Master vs. human games were not selected, and the Elf commentaries are comprehensive. (Different team. :))
The Adkins Principle:
At some point, doesn't thinking have to go on?
— Winona Adkins

Visualize whirled peas.

Everything with love. Stay safe.
Kirby
Honinbo
Posts: 9553
Joined: Wed Feb 24, 2010 6:04 pm
GD Posts: 0
KGS: Kirby
Tygem: 커비라고해
Has thanked: 1583 times
Been thanked: 1707 times

Re: Different Quality of Play between Current AI and AlphaGo

Post by Kirby »

Bill Spight wrote:At least the Master vs. human games were not selected
Except that they only chose to reveal that Master was associated with AlphaGo *after* it showed an impressive result ;-)
be immersed
Bill Spight
Honinbo
Posts: 10905
Joined: Wed Apr 21, 2010 1:24 pm
Has thanked: 3651 times
Been thanked: 3373 times

Re: Different Quality of Play between Current AI and AlphaGo

Post by Bill Spight »

Kirby wrote:
Bill Spight wrote:At least the Master vs. human games were not selected
Except that they only chose to reveal that Master was associated with AlphaGo *after* it showed an impressive result ;-)
That's true. :mrgreen:

But the games were still representative. No selection bias.
The Adkins Principle:
At some point, doesn't thinking have to go on?
— Winona Adkins

Visualize whirled peas.

Everything with love. Stay safe.
hydrogenpi7
Dies in gote
Posts: 63
Joined: Sat Mar 25, 2017 3:19 pm
GD Posts: 0
Been thanked: 3 times

Re: Different Quality of Play between Current AI and AlphaGo

Post by hydrogenpi7 »

jann wrote:There were some claims on LZ discord that most of today's bots, including LZ and Minigo already reached AG0 levels (raw nets I guess, hw oc makes difference).

This question has some practical consequences since LZ for example is struggling to gain further strength at the moment, and it is unclear if this is already the limitation of 40b networks (which AG0 used as well), or a consequence of some other problem.

So using something like a Huawei Atlas 900 cluster and the latest LZ, that means it can beat the AlphaGoZero?

Would Huawei and Google we willing to do a match?
hydrogenpi7
Dies in gote
Posts: 63
Joined: Sat Mar 25, 2017 3:19 pm
GD Posts: 0
Been thanked: 3 times

Re: Different Quality of Play between Current AI and AlphaGo

Post by hydrogenpi7 »

Bill Spight wrote:
jann wrote:
jlt wrote:LeelaZero has 16 million self-play games, while AlphaZero has 29 million, so Leelazero may be a bit less strong than AlphaZero, but is probably not very far.
It is not easy to compare LZ nets to AG0 nets since the published games not only use high playouts, but are also hand-selected (which may explain some brilliancies).
Yes, I lost faith in the DeepMind team when they only published selected games, which made them useless for scientific purposes. Good PR, I suppose. You got the same kind of thing in chess. They knew better.

At least the Master vs. human games were not selected, and the Elf commentaries are comprehensive. (Different team. :))
also for what it is worth, had they played Lee with the Fan version of AG, good chance Lee would have won the match, had they played Ke Jie with the Lee version of AlphaGo, good chance Ke Jie might have won some games etc... I think they were conservative in that they wanted to assure wins at each step up the AI strength ladder, while in retrospect it seems clear the Fan version of AlphaGo was very weak and not "superhuman", certaintly current LZ is already past even the Lee version of AlphaGo
Post Reply