emerus wrote:xela wrote:I really think an example would help.
3rd game I opened:
Game here
Not +/-10 but I am not going to look very hard for something that I've seen at least 1/10 of the games I open into KataGo. If you are a user of KataGo and haven't noticed this by now, then you should look for it.
How often do you think professionals in post-AI age actually have such a large (>5 scoreMean) deficit by move 41? KataGo thinks it is like 10% of the time. It is ludicrous to me.
OK, thanks! Now you're a much stronger player than me, so probably I'm about to learn something important here. But so far I still feel as though I'm missing something. To me it's not looking all that ludicrous.
For anyone else who wants to check it out: we're looking at this game --
Position at move 41:
$$Bc19m41
$$ +---------------------------------------+
$$ | . . . . . . . . . . . . . . . . . . . |
$$ | . . . . . . . . . . . . X O . O X . . |
$$ | . . . . . . . . . . O . . . O X . . . |
$$ | . . . O . . . . . X . . X . . . X . . |
$$ | . . . . . . . . . . . X . O O . . . . |
$$ | . . . . . . . . . . . . . O X X . . . |
$$ | . . . . . . . . . . . . . . . . 1 . . |
$$ | . . . . . . . . . . . . . . . O . . . |
$$ | . . . . . . . . . . . . . . . . . . . |
$$ | . . . . . . . . . . . . . . . . . . . |
$$ | . . . . . . . . . . . . . . . O . . . |
$$ | . . . . . . . . . . . . . . . . . . . |
$$ | . . . . . . . . . . . . . . . O . . . |
$$ | . . X . . . . . . . . . . . X . . . . |
$$ | . . . O . . . . . . . . . . . O O . . |
$$ | . . X O . . . . . . . . . X . . X . . |
$$ | . . X O . X . O . . O . X . . X . . . |
$$ | . . X X O O . . . . . . . . . . . . . |
$$ | . . . . . . . . . . . . . . . . . . . |
$$ +---------------------------------------+
- Click Here To Show Diagram Code
[go]$$Bc19m41
$$ +---------------------------------------+
$$ | . . . . . . . . . . . . . . . . . . . |
$$ | . . . . . . . . . . . . X O . O X . . |
$$ | . . . . . . . . . . O . . . O X . . . |
$$ | . . . O . . . . . X . . X . . . X . . |
$$ | . . . . . . . . . . . X . O O . . . . |
$$ | . . . . . . . . . . . . . O X X . . . |
$$ | . . . . . . . . . . . . . . . . 1 . . |
$$ | . . . . . . . . . . . . . . . O . . . |
$$ | . . . . . . . . . . . . . . . . . . . |
$$ | . . . . . . . . . . . . . . . . . . . |
$$ | . . . . . . . . . . . . . . . O . . . |
$$ | . . . . . . . . . . . . . . . . . . . |
$$ | . . . . . . . . . . . . . . . O . . . |
$$ | . . X . . . . . . . . . . . X . . . . |
$$ | . . . O . . . . . . . . . . . O O . . |
$$ | . . X O . . . . . . . . . X . . X . . |
$$ | . . X O . X . O . . O . X . . X . . . |
$$ | . . X X O O . . . . . . . . . . . . . |
$$ | . . . . . . . . . . . . . . . . . . . |
$$ +---------------------------------------+[/go]
At move 40, KataGo on my machine with 20,000 playouts has white just over 7 "points" ahead, with 71% winrate. In other words, white has caught up on the board. We know this is possible because white used to win sometimes in the no-komi era. And recently Bill has posted
26 games where one player is at a 90% winrate during the opening.
Back to our Li-Chen game: KataGo doesn't like move 41, so the "average score" changes to W+8, 72% winrate. (On a small number of playouts it actually says W+9, but the number adjusts past a few thousand playouts.) In the first 40 moves, there's no single move that KataGo thinks is a blunder, it's more a matter of several small "errors" adding up to a white lead. I can see a few black moves that go against Uberdude's descriptions of AI style on these forums -- move 5, black approaching a 4-4 instead of making an enclosure from 3-4; move 7 pincer; move 27 hane, so no surprise that KataGo judges things this way.
The idea that "white has caught up on the board" is something I find useful. Looking at the diagram, I can see that black has territory in three places, whereas all that white has is potential -- a framework on the right, maybe a chance to attack black's stones at the top, and first move at top left. So KataGo is trying to teach me that this potential is almost exactly equal to a certain amount of solid territory.
Then KataGo thinks there's some mistakes by both players in the next few moves after 41, and by move 100, black has caught up again. The rest of the game is pretty dramatic. White does indeed attack the black group at the top, and there's a capturing race in the centre. KataGo thinks white doesn't get enough out of the attack, and at move 122 it's looking like a won game for black. But then if KataGo is to be believed, move 153 is a blunder: black just needed to connect against a peep but tried to be too clever, and it's suddenly a close game again. There are a few more swings back and forth in the early endgame. The final result is W+0.5. Overall an interesting game, thanks for sharing this one!
I tried with some other bots. ELF is known for giving more extreme winrates. But here, ELF says W is up 72% at move 40 and 78% at move 41, still less extreme than Bill's examples. LZ with network number 242 has 76% and 76%: it doesn't think move 41 is bad, but agrees that black has fallen behind earlier. An older, gentler LZ (network 157) has 65% and 67%. They're all telling much the same story.
So what's the misleading bit here? Is it that it looked like white was "miles ahead" yet it ended up as a very close game? Are you saying that the position is even at move 41 and all the AIs are giving us the wrong judgement, you don't think it's likely that black made mistakes early in the game then white made mistakes later? Or are you happy with a 70% or 80% winrate at move 40 but don't like to see this translated into a score difference?
Personally I actually would expect to see large swings in the opening, and more than 1/10 of the time. I suspect that many pros aren't going to be happy playing safe, conventional opening moves all the time. There will often be at least one person at the board who thinks they are stronger than the opponent (or better prepared, or luckier on that day) and that the best way to get the win is to unbalance the game. So you take a risk and depart from the usual patterns -- if it pays off, you secure a massive territory or kill a group, you're +15 or more, the opponent resigns. If it doesn't pay off, you're -15 and you're the one resigning. A lot of games do end by resignation, so it seems obvious that even pros make significant mistakes in well over 10% of their games. Why shouldn't some of those mistakes happen before move 40?
Seriously, these are genuine questions, I'm not trying to criticise you. But it's obvious that your instincts are very different from mine here (and you spoke earlier of "anyone who understands networks or computer programming" -- I've done a fair bit of study on those topics), so I want to see what I can learn from this conversation. Thanks again for replying to my first question and showing us an interesting game.