AlphaGo Teach discussion (Go Tool from DeepMind)
-
pookpooi
- Lives in sente
- Posts: 727
- Joined: Sat Aug 21, 2010 12:26 pm
- GD Posts: 10
- Has thanked: 44 times
- Been thanked: 218 times
AlphaGo Teach discussion (Go Tool from DeepMind)
Is this the tool you expect from DeepMind?
https://alphagoteach.deepmind.com
From Aja Huang's facebook
大家好,很高興向大家宣布,AlphaGo教學工具上線了。
這是一個AlphaGo教學的開局庫,相關細節如下:
1. 本教學工具總共收錄了約6000個近代圍棋史上主要的開局變化,從23萬個人類棋譜中收集而來。
2. 所有盤面都有AlphaGo評估的勝率,以及AlphaGo推薦的下法。
3. 所有AlphaGo的勝率與推薦下法,AlphaGo都思考將近10分鐘(1000萬次模擬)。
4. 每一個開局變化,AlphaGo都固定延伸20步棋。加上AlphaGo的下法,整個教學工具約有2萬個分支變化,37萬個盤面。
受限於投稿《自然》雜誌的時程,本教學工具使用的版本是AlphaGo Master。希望大家享受AlphaGo教學工具中的創新下法,也能從中有所收穫
Translation provided by Reddit user AngelLeliel
Hello everyone, I am happy to announce that AlphaGo teaching tool is online. This is a opening library of AlphaGo teaching, details are below: The teaching tool has collected about 6,000 major opening variations in the history of modern Go, collected from 230,000 human plays. All boards have the AlphaGo estimated winning rates, as well as AlphaGo's recommended moves. All estimated winning rates and recommend moves are evaluated by AlphaGo for about 10 minutes (10 million simulations). AlphaGo also plays 20 more moves on every opening variation. With AlphaGo's moves, the entire teaching tools have about 20,000 branches, 370,000 game state. Due to the timeline for submission of the Nature, AlphaGo Master is the version used in this educational tool. Hopefully, everyone will enjoy these innovative moves from AlphaGo, and also benefit from it.
https://alphagoteach.deepmind.com
From Aja Huang's facebook
大家好,很高興向大家宣布,AlphaGo教學工具上線了。
這是一個AlphaGo教學的開局庫,相關細節如下:
1. 本教學工具總共收錄了約6000個近代圍棋史上主要的開局變化,從23萬個人類棋譜中收集而來。
2. 所有盤面都有AlphaGo評估的勝率,以及AlphaGo推薦的下法。
3. 所有AlphaGo的勝率與推薦下法,AlphaGo都思考將近10分鐘(1000萬次模擬)。
4. 每一個開局變化,AlphaGo都固定延伸20步棋。加上AlphaGo的下法,整個教學工具約有2萬個分支變化,37萬個盤面。
受限於投稿《自然》雜誌的時程,本教學工具使用的版本是AlphaGo Master。希望大家享受AlphaGo教學工具中的創新下法,也能從中有所收穫
Translation provided by Reddit user AngelLeliel
Hello everyone, I am happy to announce that AlphaGo teaching tool is online. This is a opening library of AlphaGo teaching, details are below: The teaching tool has collected about 6,000 major opening variations in the history of modern Go, collected from 230,000 human plays. All boards have the AlphaGo estimated winning rates, as well as AlphaGo's recommended moves. All estimated winning rates and recommend moves are evaluated by AlphaGo for about 10 minutes (10 million simulations). AlphaGo also plays 20 more moves on every opening variation. With AlphaGo's moves, the entire teaching tools have about 20,000 branches, 370,000 game state. Due to the timeline for submission of the Nature, AlphaGo Master is the version used in this educational tool. Hopefully, everyone will enjoy these innovative moves from AlphaGo, and also benefit from it.
Last edited by pookpooi on Mon Dec 11, 2017 10:58 am, edited 3 times in total.
-
GoEye2012
- Lives with ko
- Posts: 271
- Joined: Mon Oct 01, 2012 4:45 am
- Rank: IGS 7d
- GD Posts: 0
- IGS: goeye
- Location: Hong Kong
- Has thanked: 2 times
- Been thanked: 49 times
- Contact:
-
gowan
- Gosei
- Posts: 1628
- Joined: Thu Apr 29, 2010 4:40 am
- Rank: senior player
- GD Posts: 1000
- Has thanked: 546 times
- Been thanked: 450 times
Re: AlphaGo Teach discussion
Early in my go career I played through many pro games. I began to imitate pro moves in my own games but being 15 ranks weaker, of course I didn't understand many of the moves I was imitating. My imporession of AlphaGoTeach is that it doesn't teach in the sense of explaining the moves it recommends so it seems to put us in the position I was in may years ago, imitating things I didn't understand. It isn't clear whether this will even work for weaker players more than a few moves into the game. Would AlphaGoTeach work with opening moves of double digit kyu players? Presumably these games would include moves AlphaGo would not have seen in its database. Also, what is the meaning of the probability estimates in games of weak players. Does it mean the probability of a weak black player winning or the probability of AlphaGo winning from that point on?
-
jeromie
- Lives in sente
- Posts: 902
- Joined: Fri Jan 31, 2014 7:12 pm
- Rank: AGA 3k
- GD Posts: 0
- Universal go server handle: jeromie
- Location: Fort Collins, CO
- Has thanked: 319 times
- Been thanked: 287 times
Re: AlphaGo Teach discussion (Go Tool from DeepMind)
I believe Deepmind made an announcement that Ke Jie was going to work with them when they were developing this tool. (I’ll have to search and see if I can find that quote.) I wonder if he has had early access to it, and if that has contributed to his changing style?
-
moha
- Lives in gote
- Posts: 311
- Joined: Wed May 31, 2017 6:49 am
- Rank: 2d
- GD Posts: 0
- Been thanked: 45 times
Re: AlphaGo Teach discussion (Go Tool from DeepMind)
I cannot seem to explore variations in Chrome, even though I see they are there when I load the book into another program...Gomoto wrote:It is working fine here with firefox, some problems with google chrome
- Bonobo
- Oza
- Posts: 2223
- Joined: Fri Dec 23, 2011 6:39 pm
- Rank: OGS 9k
- GD Posts: 0
- OGS: trohde
- Universal go server handle: trohde
- Location: Germany
- Has thanked: 8262 times
- Been thanked: 924 times
- Contact:
Re: AlphaGo Teach discussion (Go Tool from DeepMind)
Thanks, pookpooi — I’d have wished for some sort of Nuremberg Funnel, of course
________
__________________________________________________________________
As for me and my dwindling learning abilities …
________
Works fine here with Google Chrome version 63.0.3239.84 on macOS Sierra 10.12.6 … takes a while to load, though.Gomoto wrote:some problems with google chrome
__________________________________________________________________
As for me and my dwindling learning abilities …
“The only difference between me and a madman is that I’m not mad.” — Salvador Dali ★ Play a slooooow correspondence game with me on OGS? 
-
pookpooi
- Lives in sente
- Posts: 727
- Joined: Sat Aug 21, 2010 12:26 pm
- GD Posts: 10
- Has thanked: 44 times
- Been thanked: 218 times
Re: AlphaGo Teach discussion (Go Tool from DeepMind)
We’re also working on a teaching tool - one of the top requests we’ve received throughout this week. The tool will show AlphaGo’s analysis of Go positions, providing an insight into how the program thinks, and hopefully giving all players and fans the opportunity to see the game through the lens of AlphaGo. We’re particularly honoured that our first collaborator in this effort will be the great Ke Jie, who has agreed to work with us on a study of his match with AlphaGo. We’re excited to hear his insights into these amazing games, and to have the chance to share some of AlphaGo’s own analysis too.jeromie wrote:I believe Deepmind made an announcement that Ke Joe was going to work with them when they were developing this tool. (I’ll have to search and see if I can find that quote.) I wonder if he has had early access to it, and if that has contributed to his changing style?
https://deepmind.com/blog/alphagos-next-move/
Maybe his recent extreme AlphaGo style is because he's absorbing so much information from this collaboration?
- EdLee
- Honinbo
- Posts: 8859
- Joined: Sat Apr 24, 2010 6:49 pm
- GD Posts: 312
- Location: Santa Barbara, CA
- Has thanked: 349 times
- Been thanked: 2070 times
1. This teaching tool has compiled about 6,000 mainstream opening variations in modern Go, gleaned from 230,000 human games.1. The teaching tool has collected a total of about 6,000 major changes in the history of modern Go, collected from 230,000 human chess records.
...as well as AlphaGo's own continuations.as well as AlphaGo's recommended downside.
All of AlphaGo's win rates and variations are based on a thinking time of about 10 minutes per move (10 million simulations).All AlphaGo wins and recommends the next law, AlphaGo are thinking about 10 minutes (10 million simulations).
For each opening variation, AlphaGo provides a fixed continuation of 20 moves.Every opening change, AlphaGo are fixed extended 20 chess.
Together with AlphaGo's moves, the teaching tool includes a total of about 20,000 variations and 370,000 full-board positions.Coupled with the AlphaGo method, the entire teaching tools about 20,000 branch changes, 370,000 disk.
-
Uberdude
- Judan
- Posts: 6727
- Joined: Thu Nov 24, 2011 11:35 am
- Rank: UK 4 dan
- GD Posts: 0
- KGS: Uberdude 4d
- OGS: Uberdude 7d
- Location: Cambridge, UK
- Has thanked: 436 times
- Been thanked: 3718 times
Re: AlphaGo Teach discussion (Go Tool from DeepMind)
Yup, I've told a friend who works at DeepMind. Fails in Chrome on Win 7 for me, but works in Firefox. Funny how building a superhuman go bot is easier than making a cross-browser web UImoha wrote:I cannot seem to explore variations in Chrome, even though I see they are there when I load the book into another program...Gomoto wrote:It is working fine here with firefox, some problems with google chrome
P.S. I actually have a bit of a concern in that rather encouraging creativity, this could make Go more like chess in terms of having a rigid opening book (this giant variation file, 7 MB sgf), will people want to keep playing openings that AlphaGo said was bad?
- EdLee
- Honinbo
- Posts: 8859
- Joined: Sat Apr 24, 2010 6:49 pm
- GD Posts: 312
- Location: Santa Barbara, CA
- Has thanked: 349 times
- Been thanked: 2070 times
Given an AG-evaluated "bad" opening (say, sanrensei), if it requires 3 stones beyond top human level to exploit this 'mistake', then it makes little difference against a human opponent, so people are happy to continue to play it (against humans).will people want to keep playing openings that AlphaGo said was bad?
OTOH, if AG exposes an established human opening as bad, and humans subsequently figure out how to exploit it, then some people may refrain from it.
-
jeromie
- Lives in sente
- Posts: 902
- Joined: Fri Jan 31, 2014 7:12 pm
- Rank: AGA 3k
- GD Posts: 0
- Universal go server handle: jeromie
- Location: Fort Collins, CO
- Has thanked: 319 times
- Been thanked: 287 times
Re: AlphaGo Teach discussion (Go Tool from DeepMind)
I don't think that will be a problem (yet), for a few reasons:Uberdude wrote:I actually have a bit of a concern in that rather encouraging creativity, this could make Go more like chess in terms of having a rigid opening book (this giant variation file, 7 MB sgf), will people want to keep playing openings that AlphaGo said was bad?
- From my (very brief, obviously) time playing with this tool, it looks like the common human moves are only about one percentage point worse than AlphaGo's top choice. For all but the very top players, that's a negligible difference.
- This tool is based on AlphaGo Master, and we've already seen that AlphaGo Zero makes different choices in the opening. That leaves a degree of uncertainty in the evaluation that I believe will keep this from being rigidly followed.
- This tool only takes us through the fuseki, so you have to be able to follow up from the final position in an AlphaGo like manner for it to be useful.
- Because the tool is based entirely on whole board positions (which makes sense based on how AlphaGo evaluates things), if one player deviates from the moves AlphaGo shows the tool can't show how to refute it.
-
Bill Spight
- Honinbo
- Posts: 10905
- Joined: Wed Apr 21, 2010 1:24 pm
- Has thanked: 3651 times
- Been thanked: 3373 times
Re: AlphaGo Teach discussion (Go Tool from DeepMind)
Just wait for AlphaChrome Zero.Uberdude wrote:Funny how building a superhuman go bot is easier than making a cross-browser web UI
The Adkins Principle:
At some point, doesn't thinking have to go on?
— Winona Adkins
Visualize whirled peas.
Everything with love. Stay safe.
At some point, doesn't thinking have to go on?
— Winona Adkins
Visualize whirled peas.
Everything with love. Stay safe.
-
Bill Spight
- Honinbo
- Posts: 10905
- Joined: Wed Apr 21, 2010 1:24 pm
- Has thanked: 3651 times
- Been thanked: 3373 times
Re: AlphaGo Teach discussion (Go Tool from DeepMind)
From what I have seen so far, which, admittedly, isn't much, 4% pts. is within the margin of error.jeromie wrote:From my (very brief, obviously) time playing with this tool, it looks like the common human moves are only about one percentage point worse than AlphaGo's top choice. For all but the very top players, that's a negligible difference.
The Adkins Principle:
At some point, doesn't thinking have to go on?
— Winona Adkins
Visualize whirled peas.
Everything with love. Stay safe.
At some point, doesn't thinking have to go on?
— Winona Adkins
Visualize whirled peas.
Everything with love. Stay safe.
-
Bill Spight
- Honinbo
- Posts: 10905
- Joined: Wed Apr 21, 2010 1:24 pm
- Has thanked: 3651 times
- Been thanked: 3373 times
Re: AlphaGo Teach discussion (Go Tool from DeepMind)
From the web site:
Simulations of what, pray tell?each move’s winning probability was computed by running an independent search of 10 million simulations from that position.
The Adkins Principle:
At some point, doesn't thinking have to go on?
— Winona Adkins
Visualize whirled peas.
Everything with love. Stay safe.
At some point, doesn't thinking have to go on?
— Winona Adkins
Visualize whirled peas.
Everything with love. Stay safe.
