twogtp 1.5.1, 3200 visits for LZ and 1600 visits for KG, all games by resignation, no error, no duplicate game
KataGo wins 31-19 = 62%
details : (KG always appears as W, because of the command -alternate)
Code: Select all
gogui-twogtp -black "C:\\Users\\jm\\gogui151\\LZ017\\leelaz.exe ..." -white "C:\\Users\\jm\\gogui151\\kata134\\katago.exe ..." -games 25 -sgffile C:\\Users\\jm\\gogui151\\kata134_white_LZ270b.dat -auto -komi 7.5
gogui-twogtp -black "C:\\Users\\jm\\gogui151\\kata134\\katago.exe ..." -white "C:\\Users\\jm\\gogui151\\LZ017\\leelaz.exe ..." -games 25 -sgffile C:\\Users\\jm\\gogui151\\kata134_black_LZ270b.dat -auto -komi 7.5You're right, with -alternate, results are is a bit strange to read, but with -alternate, there's only one stat file, with all results and details on the same page, I like that. So... I don't know what's bestAnd wrote:it is possible without an “-alternate” in one bat file
Not at all, you're right, I'll try to run longer tests, and give more details about the settings.lightvector wrote:came off as critical...
Code: Select all
# Resignation occurs if for at least resignConsecTurns in a row,
# the winLossUtility (which is on a [-1,1] scale) is below resignThreshold.
allowResignation = true
resignThreshold = -0.90
resignConsecTurns = 3Yep, the scale is -1 to 1, so it means 5%. It would have to be a weird -1 to 0 scale for it to mean 10%. And yeah, to conduct a careful comparison between different bots you'd want to take care with the resignation and temperature settings (and optimal threading and tuning, and playouts vs visits, etc). Generally, since winrates are trained on at self-play conditions and are affected by the neural net's own Bayesian-like uncertainty, whereas match conditions are played with less noise and the search helps reduce uncertainty in a way that the raw net cannot do alone, it's very rare for the game to turn around after around 5%-10% winrate in match conditions (far less than 5% to 10% of the time). I agree 20% is a bit on the high side. Although I haven't studied the exact false-resign rate so I'm not entirely sure about this, despite that it should be less than 20% of resigned games, it might still be high enough to be noticeable.ez4u wrote:My mistake!!! I misremembered that the "-r" parameter is the one for randomness, but it is not. That is "-m".![]()
However, instead of the question on randomness, don't we have an issue with when they resign? Vargo's gogui2gtp command is setting "-r 20", which AFAIK tells LZ to resign if the winrate falls below 20%. Meanwhile in Katago's default configuration...Hence Katago will only resign if it is below 5% (because the scale is -1 to 1?, or is it 10%?) for three moves in a row?Code: Select all
# Resignation occurs if for at least resignConsecTurns in a row, # the winLossUtility (which is on a [-1,1] scale) is below resignThreshold. allowResignation = true resignThreshold = -0.90 resignConsecTurns = 3
In any case, this does not seem to be symmetrical between the engines. My naive interpretation is that this represents an advantage in match conditions for Katago. Am I reading this correctly?