Winrate already means confidence in winning, and uncertainity is measured by visit counts. So we already have [estimate,visits] which MCTS is based on. You can, for example, adjust down the weight of the just performed visit based on some error term, or adjust its resulting value estimate as above. In both cases the error term is assimilated. And in the end the answer to whether A>B will come in the form of an ultimate estimate from search, which contains all known information - and this is what the user sees (with visit totals). To get an idea about the accuracy of this you need further, external information.chut wrote:The error term, if it exists, would be a measure of how confident we have of a certain move. That could be used to guide MCTS to focus more on the more uncertain branch right?I still don't see how. If a bot had a winrate estimate and an error term, it could transform those to a better estimate with no error term. Why would it keep and use the lower quality estimate in this case? And the error term doesn't apply to the corrected estimate.
Btw I would guess if you test the actual correlation to game outcomes you may find it is reasonably accurate near 50% and 100%, and not necessarily linear but monotonic in between. So 75% may be somewhat off from 0.75, but still measurably better than 70%. The correctness of this relative estimate difference is what bot strength is based on.