A model to understand komaster and komonster
Posted: Mon Jan 03, 2022 2:03 pm
Komaster is a term developed by Berlekamp & students. It regards a ko where one side has so many more ko threats that they can win the ko even without losing any points even if they haven't yet ended the ko.
In this post, I solve
a) Direct ko
b) Direct ko + RIGHT enlargement
in the cases
i) Neutral threat environment
ii) RIGHT is komaster (some komonster)
NTE appears in Bill's paper (see next post), and I misunderstood it in this post, so although my calculations are close to the right answer, they are simpler than the correct NTE calculations. I suspect that the examples in this post are simple enough that my calculations are sufficient. However, in general, my method should give results that are close to correct, but slightly off.
Direct ko
Neutral threat environment
A direct ko has 4 positions, A-B-C-D. Say the two players are LEFT and RIGHT. Let the ko swing be K, so the difference in points between winning and losing the ko is K points.
Say LEFT moves left locally when they can and RIGHT moves right. We score for RIGHT. Let A have score 0 and D have score K.
Normally if ko threats are balanced, we can estimate this as
A = 0 , B = K/3, C = 2K/3, D = K
by taking the average. The average can be argued by Jasiek's summary that all moves values (gote/sente etc) use the same unit in miai counting and playing the move with highest gain is such a good default strategy that we can assume as an estimate that a transition from B to C is only worth playing if a transition from B to A by RIGHT is worth playing by LEFT. This forces us to count B as (A+C)/2). The gain of every move here is K/3 in miai counting (or 2K/3 in deiri swing).
(Thermography provides a model to "prove" these numbers, at temperature T=0)
Komaster
However, if RIGHT is ko master, then we can value as if RIGHT has already won the ko in position C. Hence, our new estimate looks more like
A = 0, B = K/2, C = K, D = K.
(This doesn't necessarily mean RIGHT will actually win the ko in position C, but only that such a result is the optimal result for both, so perhaps RIGHT can lose the ko and get a compensation of K elsewhere).
Temperature
Technically we also need to include a temperature by including a tax of T for every move played and a bonus of T/2 for sente. We can think of T as the largest move in the environment of the ko.
NTE
So normally we might have
A = 3T/2, B = K/3+T/2, C=2K-T/2, D = K-3T/2
When K<3T, A>D, so it isn't worth playing in the ko.
Ko master
A = 3T/2, B = K/2+T/2 , C = K-T/2, D = K-3T/2
Ko monster
This means that RIGHT has so many ko threats that they can even keep playing in the environment (respond to every move in the environment that the opponent resorts to playing as a ko threat) until the temperature goes down to t<T.
Now we might have something more like
A = 3T/2, B = K/2+(3T-t)/4 , C = K-t/2, D = K-3t/2
where all the values have shifted in RIGHT's favour.
Hopefully this was a correct understanding.
Now for some more tentative analysis that might be stretching my understanding too far.
Evaluating a multi-stage ko seems a bit like an extension of evaluation of a hyperactive ko. This one where only in the C position, both players have another option. RIGHT may risk moving to E where LEFT has many local ko threats but might gain K+L upon winning the ko, or LEFT might defend at E by moving to say F at the cost of a move instead of moving to B (this removes the possibility of RIGHT winning E entirely).
Of course, RIGHT can't profit from this in a neutral threat environment since LEFT has local threats, but it will change the equilibrium temperature of the ko.
Direct ko + RIGHT enlargement option
NTE
If RIGHT's best play is to end the ko with D, then since D leaves a threat of {0|L-T} behind (if L>T) then we can assume playing L is the largest move. (if L<T, we can forget the possibility of playing E).
If L>T, LEFT must defend and at D, RIGHT expects to gain a move even though they don't get L.
A = 3T/2, B = (K+T)/3 + T/2, C = 2(K+T)/3 - T/2 , D = K+T - 3T/2
If RIGHT's best play is to play one move towards E from C, then we just write E on the right, and enter a different stage of ko where RIGHT has spent a move (like an approach ko) but has hooked on E.
A2 = T/2, B2 = (K+L)/3 - T/2, C2 = 2(K+L)/3 - 3T/2 , E = K+L - 5T/2
Since the transition occurred at state C, we want to know when equilibrium occurs, namely when C=C2 (with a transition)
we have C2-C = 2(L-T)/3 - T = 0 so L=5T/2 is the critical loss above which RIGHT wants to play towards E. We can see (as expected) that this is pretty large. This is because playing normally to D would have gotten profit from the existence of E anyway. Curiously, it is independent of K.
Similarly, if LEFT's best play is to play one move to F from C, we again enter a different stage of ko
A3 = 5T/2, B3 = K/3 - 3T/2, F = C3 = 2K/3 + T/2 , D3 = K - T/2
Equating at C again, we have
C3-C = 2T/3 - T = 0. So T=0. So only when T=0 does LEFT consider to block and wipe out the possibility of E completely. (unless C2 is relevant)
C3-C2 = 2T - 2L/3 = 0. So L=3T. When L>3T, LEFT prefers to block the possibility of E.
Summary
- T=0: LEFT can block and move to F at no cost if RIGHT might win the ko
-L<T: Both sides should treat the threat of E as a bonus of (L/2) added onto the size of the ko (with K->K+L/2). As usual, moves gain K/3. If K_old + L/2 = K_new>3T, the ko is worth fighting. So the ko is a bit bigger with higher temperature. RIGHT's profit from the existence of E is always rho x L/2 (which appears at T=L/2), where rho = 0 in A, 1/3 in B, 2/3 in C, 1 in D. C=2(K+L/2)/3-T/2
-T<L<5T/2: In this case, connecting at D is sente and K_new = K_old+T. RIGHT's profit from the existence of E is rho x T. C = 2(K+T)/3 - T/2
-5T/2<L<3T: RIGHT ignores D and fights the ko directly. C = 2(K+L)/3 - 3T/2
-L>3T: LEFT blocks the possibility of RIGHT enlarging the ko. C = 2K/3 +T/2
It is most interesting that there is a sweet spot at 5T/2<L<3T where fighting over E occurs. This is because it is large enough to be interesting for RIGHT but also LEFT has to pay a penalty of T to prevent it. This is why they say "Don't play go if you fear ko." The point is that, preventing ko variations has a cost, and hence there is a region where playing ko is optimal for both sides. Only if the ko is really big should LEFT spend a move to prevent it.
RIGHT Ko master
A = 3T/2, B = (K+T)/2 + T/2, C = K+T - T/2 , D = K+T - 3T/2
A2 = T/2, B2 = (K+L/2 - T/2, C2 = K+L - 3T/2 , E = K+L - 5T/2
A3 = 5T/2, B3 = K/2 - 3T/2, F = C3 = K + T/2 , D3 = K - T/2
C2=C when L=2T
C3=C always
C3=C2 when L=2T
Summary
We have C = K+T/2 in all cases
-L<2T: RIGHT shouldn't try to enlarge the ko (RIGHT will win the ko and threaten L anyway). Similarly, LEFT should not both stopping RIGHT from enlarging the ko.
-L>2T: RIGHT should try to enlarge the ko. However LEFT should never give RIGHT the chance and should immediately block the ko enlargement by spending a move.
In this post, I solve
a) Direct ko
b) Direct ko + RIGHT enlargement
in the cases
i) Neutral threat environment
ii) RIGHT is komaster (some komonster)
NTE appears in Bill's paper (see next post), and I misunderstood it in this post, so although my calculations are close to the right answer, they are simpler than the correct NTE calculations. I suspect that the examples in this post are simple enough that my calculations are sufficient. However, in general, my method should give results that are close to correct, but slightly off.
Direct ko
Neutral threat environment
A direct ko has 4 positions, A-B-C-D. Say the two players are LEFT and RIGHT. Let the ko swing be K, so the difference in points between winning and losing the ko is K points.
Say LEFT moves left locally when they can and RIGHT moves right. We score for RIGHT. Let A have score 0 and D have score K.
Normally if ko threats are balanced, we can estimate this as
A = 0 , B = K/3, C = 2K/3, D = K
by taking the average. The average can be argued by Jasiek's summary that all moves values (gote/sente etc) use the same unit in miai counting and playing the move with highest gain is such a good default strategy that we can assume as an estimate that a transition from B to C is only worth playing if a transition from B to A by RIGHT is worth playing by LEFT. This forces us to count B as (A+C)/2). The gain of every move here is K/3 in miai counting (or 2K/3 in deiri swing).
(Thermography provides a model to "prove" these numbers, at temperature T=0)
Komaster
However, if RIGHT is ko master, then we can value as if RIGHT has already won the ko in position C. Hence, our new estimate looks more like
A = 0, B = K/2, C = K, D = K.
(This doesn't necessarily mean RIGHT will actually win the ko in position C, but only that such a result is the optimal result for both, so perhaps RIGHT can lose the ko and get a compensation of K elsewhere).
Temperature
Technically we also need to include a temperature by including a tax of T for every move played and a bonus of T/2 for sente. We can think of T as the largest move in the environment of the ko.
NTE
So normally we might have
A = 3T/2, B = K/3+T/2, C=2K-T/2, D = K-3T/2
When K<3T, A>D, so it isn't worth playing in the ko.
Ko master
A = 3T/2, B = K/2+T/2 , C = K-T/2, D = K-3T/2
Ko monster
This means that RIGHT has so many ko threats that they can even keep playing in the environment (respond to every move in the environment that the opponent resorts to playing as a ko threat) until the temperature goes down to t<T.
Now we might have something more like
A = 3T/2, B = K/2+(3T-t)/4 , C = K-t/2, D = K-3t/2
where all the values have shifted in RIGHT's favour.
Hopefully this was a correct understanding.
Evaluating a multi-stage ko seems a bit like an extension of evaluation of a hyperactive ko. This one where only in the C position, both players have another option. RIGHT may risk moving to E where LEFT has many local ko threats but might gain K+L upon winning the ko, or LEFT might defend at E by moving to say F at the cost of a move instead of moving to B (this removes the possibility of RIGHT winning E entirely).
Of course, RIGHT can't profit from this in a neutral threat environment since LEFT has local threats, but it will change the equilibrium temperature of the ko.
Direct ko + RIGHT enlargement option
NTE
If RIGHT's best play is to end the ko with D, then since D leaves a threat of {0|L-T} behind (if L>T) then we can assume playing L is the largest move. (if L<T, we can forget the possibility of playing E).
If L>T, LEFT must defend and at D, RIGHT expects to gain a move even though they don't get L.
A = 3T/2, B = (K+T)/3 + T/2, C = 2(K+T)/3 - T/2 , D = K+T - 3T/2
If RIGHT's best play is to play one move towards E from C, then we just write E on the right, and enter a different stage of ko where RIGHT has spent a move (like an approach ko) but has hooked on E.
A2 = T/2, B2 = (K+L)/3 - T/2, C2 = 2(K+L)/3 - 3T/2 , E = K+L - 5T/2
Since the transition occurred at state C, we want to know when equilibrium occurs, namely when C=C2 (with a transition)
we have C2-C = 2(L-T)/3 - T = 0 so L=5T/2 is the critical loss above which RIGHT wants to play towards E. We can see (as expected) that this is pretty large. This is because playing normally to D would have gotten profit from the existence of E anyway. Curiously, it is independent of K.
Similarly, if LEFT's best play is to play one move to F from C, we again enter a different stage of ko
A3 = 5T/2, B3 = K/3 - 3T/2, F = C3 = 2K/3 + T/2 , D3 = K - T/2
Equating at C again, we have
C3-C = 2T/3 - T = 0. So T=0. So only when T=0 does LEFT consider to block and wipe out the possibility of E completely. (unless C2 is relevant)
C3-C2 = 2T - 2L/3 = 0. So L=3T. When L>3T, LEFT prefers to block the possibility of E.
Summary
- T=0: LEFT can block and move to F at no cost if RIGHT might win the ko
-L<T: Both sides should treat the threat of E as a bonus of (L/2) added onto the size of the ko (with K->K+L/2). As usual, moves gain K/3. If K_old + L/2 = K_new>3T, the ko is worth fighting. So the ko is a bit bigger with higher temperature. RIGHT's profit from the existence of E is always rho x L/2 (which appears at T=L/2), where rho = 0 in A, 1/3 in B, 2/3 in C, 1 in D. C=2(K+L/2)/3-T/2
-T<L<5T/2: In this case, connecting at D is sente and K_new = K_old+T. RIGHT's profit from the existence of E is rho x T. C = 2(K+T)/3 - T/2
-5T/2<L<3T: RIGHT ignores D and fights the ko directly. C = 2(K+L)/3 - 3T/2
-L>3T: LEFT blocks the possibility of RIGHT enlarging the ko. C = 2K/3 +T/2
It is most interesting that there is a sweet spot at 5T/2<L<3T where fighting over E occurs. This is because it is large enough to be interesting for RIGHT but also LEFT has to pay a penalty of T to prevent it. This is why they say "Don't play go if you fear ko." The point is that, preventing ko variations has a cost, and hence there is a region where playing ko is optimal for both sides. Only if the ko is really big should LEFT spend a move to prevent it.
RIGHT Ko master
A = 3T/2, B = (K+T)/2 + T/2, C = K+T - T/2 , D = K+T - 3T/2
A2 = T/2, B2 = (K+L/2 - T/2, C2 = K+L - 3T/2 , E = K+L - 5T/2
A3 = 5T/2, B3 = K/2 - 3T/2, F = C3 = K + T/2 , D3 = K - T/2
C2=C when L=2T
C3=C always
C3=C2 when L=2T
Summary
We have C = K+T/2 in all cases
-L<2T: RIGHT shouldn't try to enlarge the ko (RIGHT will win the ko and threaten L anyway). Similarly, LEFT should not both stopping RIGHT from enlarging the ko.
-L>2T: RIGHT should try to enlarge the ko. However LEFT should never give RIGHT the chance and should immediately block the ko enlargement by spending a move.