Number of playouts vs bayesian estimate of win rate
Posted: Sat Apr 18, 2020 4:07 am
I have just begun reading up on how the new amazing Go bots work, so please correct me if anything I write here is misguided or outright wrong.
In the thread "How much do strong bots agree on moves? Case study", the quality of moves in the AlphaGo Master games is discussed primarily in terms of "nth ranked choice" by KataGo, and secondarily in terms of number of playouts. Thread here:
viewtopic.php?f=18&t=17359
Three related questions popped into my head while reading that thread.
1. In a given board position, it could be that there are 10 moves with negligible difference in win rates, so that choosing among them is mostly a matter of personal preference. In another position, there could be only one good move, and even the second best choice has a much lower win rate. Given this variability, why discuss move quality in terms of "nth ranked choice" at all? To me the nth choice carries very little information on move quality. Am I wrong?
2. The SGF also included number of playouts for the top 3 moves. This gives a bit more information I believe. To my (very fuzzy) understanding, each playout is selected according to estimated win rates (Bayesian adjusted by number of playouts with a bit of random noise to spread moves out proportionally). The final move is the one with the most playouts, which should also be the move with the highest Bayesian win rate. Two moves with similar number of playouts should have similar Bayesian adjusted win rates. I'm in way over my head here, but is this roughly correct? Is there a direct relation between playouts and estimated win rate? For example, if one move has 10k playouts and another only 1k, it possible to say anything about difference in expected win rate?
3. When discussing move quality, aren't win rates (again, Bayesian adjusted by number of playouts) the best measure? Why do experts often use fuzzier measures like nth choice or number of playouts?
Thanks in advance for educating me!
In the thread "How much do strong bots agree on moves? Case study", the quality of moves in the AlphaGo Master games is discussed primarily in terms of "nth ranked choice" by KataGo, and secondarily in terms of number of playouts. Thread here:
viewtopic.php?f=18&t=17359
Three related questions popped into my head while reading that thread.
1. In a given board position, it could be that there are 10 moves with negligible difference in win rates, so that choosing among them is mostly a matter of personal preference. In another position, there could be only one good move, and even the second best choice has a much lower win rate. Given this variability, why discuss move quality in terms of "nth ranked choice" at all? To me the nth choice carries very little information on move quality. Am I wrong?
2. The SGF also included number of playouts for the top 3 moves. This gives a bit more information I believe. To my (very fuzzy) understanding, each playout is selected according to estimated win rates (Bayesian adjusted by number of playouts with a bit of random noise to spread moves out proportionally). The final move is the one with the most playouts, which should also be the move with the highest Bayesian win rate. Two moves with similar number of playouts should have similar Bayesian adjusted win rates. I'm in way over my head here, but is this roughly correct? Is there a direct relation between playouts and estimated win rate? For example, if one move has 10k playouts and another only 1k, it possible to say anything about difference in expected win rate?
3. When discussing move quality, aren't win rates (again, Bayesian adjusted by number of playouts) the best measure? Why do experts often use fuzzier measures like nth choice or number of playouts?
Thanks in advance for educating me!