AlphaGo Teach discussion (Go Tool from DeepMind)

For discussing go computing, software announcements, etc.
pookpooi
Lives in sente
Posts: 727
Joined: Sat Aug 21, 2010 12:26 pm
GD Posts: 10
Has thanked: 44 times
Been thanked: 218 times

AlphaGo Teach discussion (Go Tool from DeepMind)

Post by pookpooi »

Is this the tool you expect from DeepMind?

https://alphagoteach.deepmind.com

From Aja Huang's facebook

大家好,很高興向大家宣布,AlphaGo教學工具上線了。
這是一個AlphaGo教學的開局庫,相關細節如下:
1. 本教學工具總共收錄了約6000個近代圍棋史上主要的開局變化,從23萬個人類棋譜中收集而來。
2. 所有盤面都有AlphaGo評估的勝率,以及AlphaGo推薦的下法。
3. 所有AlphaGo的勝率與推薦下法,AlphaGo都思考將近10分鐘(1000萬次模擬)。
4. 每一個開局變化,AlphaGo都固定延伸20步棋。加上AlphaGo的下法,整個教學工具約有2萬個分支變化,37萬個盤面。
受限於投稿《自然》雜誌的時程,本教學工具使用的版本是AlphaGo Master。希望大家享受AlphaGo教學工具中的創新下法,也能從中有所收穫

Translation provided by Reddit user AngelLeliel
Hello everyone, I am happy to announce that AlphaGo teaching tool is online. This is a opening library of AlphaGo teaching, details are below: The teaching tool has collected about 6,000 major opening variations in the history of modern Go, collected from 230,000 human plays. All boards have the AlphaGo estimated winning rates, as well as AlphaGo's recommended moves. All estimated winning rates and recommend moves are evaluated by AlphaGo for about 10 minutes (10 million simulations). AlphaGo also plays 20 more moves on every opening variation. With AlphaGo's moves, the entire teaching tools have about 20,000 branches, 370,000 game state. Due to the timeline for submission of the Nature, AlphaGo Master is the version used in this educational tool. Hopefully, everyone will enjoy these innovative moves from AlphaGo, and also benefit from it.
Last edited by pookpooi on Mon Dec 11, 2017 10:58 am, edited 3 times in total.
GoEye2012
Lives with ko
Posts: 271
Joined: Mon Oct 01, 2012 4:45 am
Rank: IGS 7d
GD Posts: 0
IGS: goeye
Location: Hong Kong
Has thanked: 2 times
Been thanked: 49 times
Contact:

Re: AlphaGo Teach discussion

Post by GoEye2012 »

pookpooi wrote:Is this the tool you expect from DeepMind?

https://alphagoteach.deepmind.com


A little disappointed.
gowan
Gosei
Posts: 1628
Joined: Thu Apr 29, 2010 4:40 am
Rank: senior player
GD Posts: 1000
Has thanked: 546 times
Been thanked: 450 times

Re: AlphaGo Teach discussion

Post by gowan »

Early in my go career I played through many pro games. I began to imitate pro moves in my own games but being 15 ranks weaker, of course I didn't understand many of the moves I was imitating. My imporession of AlphaGoTeach is that it doesn't teach in the sense of explaining the moves it recommends so it seems to put us in the position I was in may years ago, imitating things I didn't understand. It isn't clear whether this will even work for weaker players more than a few moves into the game. Would AlphaGoTeach work with opening moves of double digit kyu players? Presumably these games would include moves AlphaGo would not have seen in its database. Also, what is the meaning of the probability estimates in games of weak players. Does it mean the probability of a weak black player winning or the probability of AlphaGo winning from that point on?
Gomoto
Gosei
Posts: 1733
Joined: Sun Nov 06, 2016 6:56 am
GD Posts: 0
Location: Earth
Has thanked: 621 times
Been thanked: 310 times

Re: AlphaGo Teach discussion (Go Tool from DeepMind)

Post by Gomoto »

I for one, love it.

It is working fine here with firefox, some problems with google chrome :lol:
jeromie
Lives in sente
Posts: 902
Joined: Fri Jan 31, 2014 7:12 pm
Rank: AGA 3k
GD Posts: 0
Universal go server handle: jeromie
Location: Fort Collins, CO
Has thanked: 319 times
Been thanked: 287 times

Re: AlphaGo Teach discussion (Go Tool from DeepMind)

Post by jeromie »

I believe Deepmind made an announcement that Ke Jie was going to work with them when they were developing this tool. (I’ll have to search and see if I can find that quote.) I wonder if he has had early access to it, and if that has contributed to his changing style?
moha
Lives in gote
Posts: 311
Joined: Wed May 31, 2017 6:49 am
Rank: 2d
GD Posts: 0
Been thanked: 45 times

Re: AlphaGo Teach discussion (Go Tool from DeepMind)

Post by moha »

Gomoto wrote:It is working fine here with firefox, some problems with google chrome :lol:
I cannot seem to explore variations in Chrome, even though I see they are there when I load the book into another program...
User avatar
Bonobo
Oza
Posts: 2223
Joined: Fri Dec 23, 2011 6:39 pm
Rank: OGS 9k
GD Posts: 0
OGS: trohde
Universal go server handle: trohde
Location: Germany
Has thanked: 8262 times
Been thanked: 924 times
Contact:

Re: AlphaGo Teach discussion (Go Tool from DeepMind)

Post by Bonobo »

Thanks, pookpooi — I’d have wished for some sort of Nuremberg Funnel, of course :D
________

Gomoto wrote:some problems with google chrome
Works fine here with Google Chrome version 63.0.3239.84 on macOS Sierra 10.12.6 … takes a while to load, though.

__________________________________________________________________

As for me and my dwindling learning abilities …
Image
“The only difference between me and a madman is that I’m not mad.” — Salvador Dali ★ Play a slooooow correspondence game with me on OGS? :)
pookpooi
Lives in sente
Posts: 727
Joined: Sat Aug 21, 2010 12:26 pm
GD Posts: 10
Has thanked: 44 times
Been thanked: 218 times

Re: AlphaGo Teach discussion (Go Tool from DeepMind)

Post by pookpooi »

jeromie wrote:I believe Deepmind made an announcement that Ke Joe was going to work with them when they were developing this tool. (I’ll have to search and see if I can find that quote.) I wonder if he has had early access to it, and if that has contributed to his changing style?


We’re also working on a teaching tool - one of the top requests we’ve received throughout this week. The tool will show AlphaGo’s analysis of Go positions, providing an insight into how the program thinks, and hopefully giving all players and fans the opportunity to see the game through the lens of AlphaGo. We’re particularly honoured that our first collaborator in this effort will be the great Ke Jie, who has agreed to work with us on a study of his match with AlphaGo. We’re excited to hear his insights into these amazing games, and to have the chance to share some of AlphaGo’s own analysis too.

https://deepmind.com/blog/alphagos-next-move/

Maybe his recent extreme AlphaGo style is because he's absorbing so much information from this collaboration?
User avatar
EdLee
Honinbo
Posts: 8859
Joined: Sat Apr 24, 2010 6:49 pm
GD Posts: 312
Location: Santa Barbara, CA
Has thanked: 349 times
Been thanked: 2070 times

Post by EdLee »

1. The teaching tool has collected a total of about 6,000 major changes in the history of modern Go, collected from 230,000 human chess records.
1. This teaching tool has compiled about 6,000 mainstream opening variations in modern Go, gleaned from 230,000 human games.
as well as AlphaGo's recommended downside.
...as well as AlphaGo's own continuations.
All AlphaGo wins and recommends the next law, AlphaGo are thinking about 10 minutes (10 million simulations).
All of AlphaGo's win rates and variations are based on a thinking time of about 10 minutes per move (10 million simulations).
Every opening change, AlphaGo are fixed extended 20 chess.
For each opening variation, AlphaGo provides a fixed continuation of 20 moves.
Coupled with the AlphaGo method, the entire teaching tools about 20,000 branch changes, 370,000 disk.
Together with AlphaGo's moves, the teaching tool includes a total of about 20,000 variations and 370,000 full-board positions.
Uberdude
Judan
Posts: 6727
Joined: Thu Nov 24, 2011 11:35 am
Rank: UK 4 dan
GD Posts: 0
KGS: Uberdude 4d
OGS: Uberdude 7d
Location: Cambridge, UK
Has thanked: 436 times
Been thanked: 3718 times

Re: AlphaGo Teach discussion (Go Tool from DeepMind)

Post by Uberdude »

moha wrote:
Gomoto wrote:It is working fine here with firefox, some problems with google chrome :lol:
I cannot seem to explore variations in Chrome, even though I see they are there when I load the book into another program...

Yup, I've told a friend who works at DeepMind. Fails in Chrome on Win 7 for me, but works in Firefox. Funny how building a superhuman go bot is easier than making a cross-browser web UI :lol:

P.S. I actually have a bit of a concern in that rather encouraging creativity, this could make Go more like chess in terms of having a rigid opening book (this giant variation file, 7 MB sgf), will people want to keep playing openings that AlphaGo said was bad?
User avatar
EdLee
Honinbo
Posts: 8859
Joined: Sat Apr 24, 2010 6:49 pm
GD Posts: 312
Location: Santa Barbara, CA
Has thanked: 349 times
Been thanked: 2070 times

Post by EdLee »

will people want to keep playing openings that AlphaGo said was bad?
Given an AG-evaluated "bad" opening (say, sanrensei), if it requires 3 stones beyond top human level to exploit this 'mistake', then it makes little difference against a human opponent, so people are happy to continue to play it (against humans).

OTOH, if AG exposes an established human opening as bad, and humans subsequently figure out how to exploit it, then some people may refrain from it.
jeromie
Lives in sente
Posts: 902
Joined: Fri Jan 31, 2014 7:12 pm
Rank: AGA 3k
GD Posts: 0
Universal go server handle: jeromie
Location: Fort Collins, CO
Has thanked: 319 times
Been thanked: 287 times

Re: AlphaGo Teach discussion (Go Tool from DeepMind)

Post by jeromie »

Uberdude wrote:I actually have a bit of a concern in that rather encouraging creativity, this could make Go more like chess in terms of having a rigid opening book (this giant variation file, 7 MB sgf), will people want to keep playing openings that AlphaGo said was bad?


I don't think that will be a problem (yet), for a few reasons:
  • From my (very brief, obviously) time playing with this tool, it looks like the common human moves are only about one percentage point worse than AlphaGo's top choice. For all but the very top players, that's a negligible difference.
  • This tool is based on AlphaGo Master, and we've already seen that AlphaGo Zero makes different choices in the opening. That leaves a degree of uncertainty in the evaluation that I believe will keep this from being rigidly followed.
  • This tool only takes us through the fuseki, so you have to be able to follow up from the final position in an AlphaGo like manner for it to be useful.
  • Because the tool is based entirely on whole board positions (which makes sense based on how AlphaGo evaluates things), if one player deviates from the moves AlphaGo shows the tool can't show how to refute it.

That may be more of a concern when we have general access to a computer program that can play at AlphaGo's level, though. At the very least, people will be able to hunt for refutations to common joseki / fuseki, and it's likely that some will fall out of favor. But that's been true of every era of Go.
Bill Spight
Honinbo
Posts: 10905
Joined: Wed Apr 21, 2010 1:24 pm
Has thanked: 3651 times
Been thanked: 3373 times

Re: AlphaGo Teach discussion (Go Tool from DeepMind)

Post by Bill Spight »

Uberdude wrote:Funny how building a superhuman go bot is easier than making a cross-browser web UI :lol:


Just wait for AlphaChrome Zero. :mrgreen:
The Adkins Principle:
At some point, doesn't thinking have to go on?
— Winona Adkins

Visualize whirled peas.

Everything with love. Stay safe.
Bill Spight
Honinbo
Posts: 10905
Joined: Wed Apr 21, 2010 1:24 pm
Has thanked: 3651 times
Been thanked: 3373 times

Re: AlphaGo Teach discussion (Go Tool from DeepMind)

Post by Bill Spight »

jeromie wrote:From my (very brief, obviously) time playing with this tool, it looks like the common human moves are only about one percentage point worse than AlphaGo's top choice. For all but the very top players, that's a negligible difference.


From what I have seen so far, which, admittedly, isn't much, 4% pts. is within the margin of error.
The Adkins Principle:
At some point, doesn't thinking have to go on?
— Winona Adkins

Visualize whirled peas.

Everything with love. Stay safe.
Bill Spight
Honinbo
Posts: 10905
Joined: Wed Apr 21, 2010 1:24 pm
Has thanked: 3651 times
Been thanked: 3373 times

Re: AlphaGo Teach discussion (Go Tool from DeepMind)

Post by Bill Spight »

From the web site:

each move’s winning probability was computed by running an independent search of 10 million simulations from that position.


Simulations of what, pray tell?
The Adkins Principle:
At some point, doesn't thinking have to go on?
— Winona Adkins

Visualize whirled peas.

Everything with love. Stay safe.
Post Reply