Life In 19x19 http://lifein19x19.com/ |
|
Numerical evaluation theory of thickness http://lifein19x19.com/viewtopic.php?f=15&t=18732 |
Page 1 of 2 |
Author: | dhu163 [ Fri May 13, 2022 3:47 pm ] |
Post subject: | Numerical evaluation theory of thickness |
There used to be a lot of discussion in L19 about the true meaning of thickness. There is an old heuristic of counting 3 points for every height of a thick wall (note that the thickness of the wall doesn't directly contribute to this equation, and instead is normally discounted for the value of an attack by the opponent, assuming the wall can be defended). I will add a theorist's perspective. If a group G is completely alive in empty space, they regardless of how the opponent approaches, unless they occupy the empty positions neighbouring G, then G has at least 50% chance of occupying any (dame) point adjacent to G. There is also good chance of making territory locally too. Even if the opponent is strong, they still can't afford to play too loosely near to wall, or else they can be captured. If we estimate, a la Spight influence functions, the influence of strength decays by a factor of two with every move played, then under stone counting, the wall has a value of 1 + 1/2 + 1/4 + ... = 2 for each height. I can't really justify that the possibility of making territory doesn't change this value, but atm, I can't even tell you if it should increase or decrease this value. If the opponent has a strong group nearby, then they compete for value, perhaps by cancelling out value somewhere along the way so it becomes 1 + 1/2 + 1/4 + 1/4 = 2 (sente-gote tends to double the probability a boundary play occurs). This is no change at all. (note we haven't counted the value of the opponent's group) If the opponent doesn't have a strong group nearby, then pushing the value of the wall up to 3 per height seems a good estimate. There is probably an argument for why it must be less than 4. Now compared this to if G wasn't completely alive or could be cut. For every move that the opponent plays, it is more likely to be sente. If the group is killed, then though it still has lingering influence with threats to save it, this becomes a drastic reduction. For example, perhaps we should instead count influence as 0 + 0 + 1/4 if the capturer expects to answer threats two moves away from saving the group. Then for every move that the opponent plays nearby, if we assume the owner of G may ignore the threats (ignoring the sente reduction), we can add a value of (2-1/4)/(2^n) if the threat is n moves from capturing G to the opponent's moves. Accounting for the sente reduction (the amount depends on the local temperature), then we should add less value. However, be careful as cuts can be double attacks if the player has another group H nearby that depends on G for support, increasing the value. Overall, in summary, a good rule of thumb seems to be that having thickness can reduce the value even of strong moves by the opponent nearby by up to a factor of 2x. (and weak dead moves by as much as the global temperature). There were many weak assumptions in this derivation, but I think this is a good summary regardless. Playing near thickness is like playing in the centre. It is less valuable than the corner, but far from worthless. |
Author: | RobertJasiek [ Fri May 13, 2022 11:24 pm ] |
Post subject: | Re: Numerical evaluation theory of thickness |
See https://www.lifein19x19.com/viewtopic.php?f=17&t=18734 for the good definitions! |
Author: | kvasir [ Tue May 17, 2022 3:37 pm ] |
Post subject: | Re: Numerical evaluation theory of thickness |
Maybe influence functions go back all the way to Zobrist? The three points per length of wall should be compared to counter examples. Such rules seem to fail because they are too mechanical and there isn't a criteria when to not "count" thickness. Personally I like the QARTS system because it works with the players understanding (or lack of) and not against it. What I mean is that QARTS subtracts 20 for each weak group that isn't able to make eyes and 10 for weak groups that can make a single eye, the problem for the player is then to find the eyes and group safety is what most of us should be thinking about before counting thickness. It is also straight forward to extend to cases where you make judgment to subtract anywhere between 0-20 points for a weak group. I'm not sure if "the evaluation theory of thickness" is supposed to work in the opening, middlegame or the endgame? It can't be the case that thickness has the same value at every stage of the game. Probably there should on average be a gradual decline in the value of thickness from the opening to the endgame, and in the middle game it is probably more concreate in that you try to effect the thickness in the middle game. Every game is different though and there is the kind of whole board thickness that only really becomes useful in the endgame. My own experience with counting three points for walls in the late middlegame / early endgame is that it doesn't work: 1. because it overvalues the thickness when there aren't weaknesses to exploit; 2. it is a biased estimate and doesn't get you the right answer when you are close to solving the endgame; 3. it doesn't seem to guide where to play. Maybe others have a more positive experience (and I probably put it in a more negative way than needed)? |
Author: | RobertJasiek [ Wed May 18, 2022 12:25 am ] |
Post subject: | Re: Numerical evaluation theory of thickness |
kvasir wrote: 20 for each weak group that isn't able to make eyes and 10 for weak groups that can make a single eye Far too simplistic. Quote: I'm not sure if "the evaluation theory of thickness" is supposed to work in the opening, middlegame or the endgame? Always, of course! Quote: Probably there should on average be a gradual decline in the value of thickness from the opening to the endgame Groups representing thickness can survive or die so the connection and life aspects of thickness decrease or increase. For surviving thickness, its aspect of new territory potential can increase if a) the connection and life aspects of thickness increase or b) the opponent's stones in the environment become weaker to increase the new territory potential of thickness. Otherwise, for surviving thickness, its aspect of new territory potential decreases to eventually zero at the game end while it should be realised. The values of thickness must be reevaluated after each move. My model allows all that. |
Author: | John Fairbairn [ Wed May 18, 2022 2:13 am ] |
Post subject: | Re: Numerical evaluation theory of thickness |
I'm encouraged that at long last at least some people are making the distinction between thickness and influence. But there's still a long way to go if we want to get into synch with the Japanese pro usage of the words. I think in particular that the first step is to make a case that a numerical evaluation of thickness can (or even should) be made. To say it is necessary for computer algorithms to work is no sort of case for humans. In my now rather vast compendium of index of Go Wisdom concepts, thickness is one of the biggies. For example, in Kamakura there are about 40 instances for just 10 games, and that is without counting closely related topics such as thinness, walls, and influence. In not one of those instances, as far as I can recall (and likewise in all the other GW books), is there even a hint of a pro attaching a number to thickness. It is true that a couple of Japanese pros have written books in which they appear to put a value on potential territory associated with thickness, but (a) the value is on the territory not the thickness and (b) they don't appear to use such numbers in their own games/commentaries. I infer these books are just sops to lazy amateurs, and maybe even ghost-written by amateurs. The much quoted 3 points per stone heuristic is something I associate with Bill Spight, though I think he told me once that he got it from someone else - certainly not a pro. The related heuristic of 6 points per stone in a moyo is something I heard from Korean amateurs. I've never seen it linked with a pro. So, apart hearing why thickness should be counted, it appears we need also an explanation why amateurs cleave so much to counting thickness (and other things) whereas pros don't. In real, pro-commentary life, the way thickness is talked about is rather about the way it adumbrates the game. It provides context. It determines strategies (including strategic mistakes). It's a gross form of signposting. It tells you what you can or should do next, or shouldn't do. And, along those lines, the one phrase that comes up most often in pro talk about thickness is "keep away from thickness - including your own". That's seems a lot more valuable than numbers of spurious accuracy. |
Author: | RobertJasiek [ Wed May 18, 2022 2:29 am ] |
Post subject: | Re: Numerical evaluation theory of thickness |
Reply to John: https://www.lifein19x19.com/viewtopic.p ... 32#p273032 |
Author: | kvasir [ Wed May 18, 2022 9:48 am ] |
Post subject: | Re: Numerical evaluation theory of thickness |
RobertJasiek wrote: kvasir wrote: 20 for each weak group that isn't able to make eyes and 10 for weak groups that can make a single eye Far too simplistic. I think it was meant to accompany a positional estimate and a rough count that rounds to 5-10 points in the opening and middle game, but note that it is not a numerical evaluation of thickness but of the burden of weak groups, it was an example. In the context of rough estimates it doesn't appear too simplistic. It seems reasonable to assume the weak groups are going to survive and estimate the effect of having to take care of them, that is if one desires to compare weak groups to points. Also -20 points, while being a round number is also similar to 2 handicaps and 0.5 komi (-19 points by KataGo) and is not necessarily arbitrary and could be adjusted or based on an actual estimate of the cost of suffering the attack. My point was not that this is a fantastically accurate and sophisticated system, it was that it works with the player's understanding and not against it. If the player thinks he can handle the group without suffering then he need not subtract 20 points, and if more precise estimate is needed then the player can estimate how much he will suffer by envisioning the attack. In contrast, systems that tally up stones in walls seem to fail to accommodate the player and his thought process (the player checks weaknesses, groups safety, safe territory, potential, center control, and etc. -- importantly, using the ability to see a few moves ahead in the game) and instead work against the player by diverting his attention to something that is of little importance or at least is not accomplished with the the usual playing skills. I think a "numerical evaluation" of thickness is more useful if it builds on the players skills instead. RobertJasiek wrote: Quote: Probably there should on average be a gradual decline in the value of thickness from the opening to the endgame Groups representing thickness can survive or die so the connection and life aspects of thickness decrease or increase. For surviving thickness, its aspect of new territory potential can increase if a) the connection and life aspects of thickness increase or b) the opponent's stones in the environment become weaker to increase the new territory potential of thickness. Otherwise, for surviving thickness, its aspect of new territory potential decreases to eventually zero at the game end while it should be realised. The values of thickness must be reevaluated after each move. My model allows all that. In general it is probably easy to come up with numerical estimates for certain balanced positions but I have doubts about unbalanced positions (i.e. thickness vs. weakness, thickness vs. moyo, thickness vs. point lead, and etc.). I don't see in the other thread that you have a model that gives a numerical estimate that is comparable to territory, except for potential territory, possibly you never proposed to compare thickness with territory? Heuristics that give information about if the position is balanced in regard to thickness, or anything else, are certainly useful. |
Author: | RobertJasiek [ Wed May 18, 2022 10:55 am ] |
Post subject: | Re: Numerical evaluation theory of thickness |
kvasir wrote: systems that tally up stones in walls seem to fail to accommodate the player and his thought process There are different systems for numbers of stones in walls. Some systems are bad while others are good. Height of a wall for finding an extension is a weak system. Counting all stones of a wall has some meaning for tewari but not for assessing thickness. Counting only the significant, outer influence stones is a good system for the application 'influence stone difference'. Knowledge of sophisticated concepts or previously insufficient thought processes are never good excuses for ignoring the most basic concepts. One very basic concept is to count numbers of stones! Any player can do this! Any player ought to do this because it has important applications for efficiency, joseki evaluation, other balance evaluation, neutral stone difference and influence stone difference. Quote: if it builds on the players skills instead. Each player has the skill to count stones! Quote: In general it is probably easy to come up with numerical estimates for certain balanced positions No. It was hard because it is basic. Recognising the basic things is hard. I needed circa two decades before I could state the simplest numerical estimates for certain balanced and fighting positions. One such value is the neutral (or dead) stone difference of newly played stones. Very basic, very easy and very important but I had to develop more complicated concepts before I recognised this simple concept. Every professional applies it presumably subconsciously (unless making severe mistakes) but nobody taught it until I discovered it as an excplicit concept. Am I still the only one to teach it? Quote: but I have doubts about unbalanced positions (i.e. thickness vs. weakness, thickness vs. moyo, thickness vs. point lead, and etc.). I have developed some theory (lots of principles) for that but these topics are advanced. An exhaustive, profound, coherent theory is still missing. Quote: I don't see in the other thread that you have a model that gives a numerical estimate that is comparable to territory, For joseki evaluation, there is my model that relates stone difference, territory difference and influence stone difference to each other. For other theory, I could only provide bits comparing territory to thickness or influence so far. However, there is also quite some theory by me that relates them by a) partial application of numbers (e.g., only the territory balance or only the influence stone difference) or b) without numbers but using rough conditions, such as a player dominating a region. You know where to find this theory of mine. Quote: possibly you never proposed to compare thickness with territory? As before. Quote: Heuristics that give information about if the position is balanced in regard to thickness, or anything else, are certainly useful. Principles and procedures for that are even better, see above. |
Author: | dhu163 [ Sat May 21, 2022 10:13 am ] |
Post subject: | Re: Numerical evaluation theory of thickness |
Just to note down the thrust of my ideas. No formulae or concrete evaluations yet (though I am working on it). Weak points and weak groups theory |
Author: | dhu163 [ Tue May 24, 2022 7:35 am ] |
Post subject: | Re: Numerical evaluation theory of thickness |
I have done some whole board (computer assisted) calculations using equations I've worked out for influence functions, and am shocked to find that it predicts the first move as gaining 13.6 +/- 10. This is entirely mathematical (no tuning), computation time scaling approximately linearly with board size (361) (10 seconds, though perhaps more accurate with more time), so I am shocked at the accuracy. Perhaps Go is really simpler than you might think? A lot of the complexity comes from life and death in the middlegame, but somehow endgame style miai counting averages gives such an accurate result for the opening too ????? I'm inclined to keep the details a secret for now. I should test it out for larger boards, and improve the error. The key equations fit on one chalkboard line or around 15 lines of computer code. It's so simple I feel someone must have thought about this before but perhaps not had the resources to do the computation, or the confidence/inclination to publicly discuss it preAI? Perhaps its just the beginning for this idea, but for me it naturally closes several of the few conversations I've had with Go researchers. To be honest, I have spent time on the mathematical side, with little understanding of the history of ideas for Go programs beyond some GnuGo reading. It isn't clear what the advantages/disadvantages of my system are compared with classical programs. I have some hope my program has already reached 15k level with much room for tuning and improvement. edit: 10 mins later. Not including systematic errors (from not quite right equations), it gives 17.7 +/- 0.1 ish. at the moment, systematic error is +/- true gain of first move (I think this is close to true from miai counting bounds even when sente/gote exists). By miai counting bounds, this "should" be an overestimate for twice komi as W has the next move which I didn't take into account. I am preparing to take it into account, but this multiplies the equation code by 4 times (with lots of room for mistakes). edit: 4hrs later. Can seem to make useful progress with my new equations. I have written some down but the computation spits out nonsense (e.g. most of the time I get B+345 or something. I think I am missing some kind of balance to them). edit: 15mins later. Ah, I accidentally played 19 moves on a row rather than one move. That's why! Now it is working again. Hmm, the new value is 10.5 +/- 3 for the value of the first move. That is too low, but still not too bad given that it gives better move suggestions. It says tengen for the first move, and otherwise 2-10, preferring the sides to the corner, but at least its no longer nonsense! edit: 15 mins later. My main criticism of my engine is that it thinks 1 eyed groups are 100% alive. Perhaps this is why it likes the centres, then sides, then its favorite corner move is 2-2, and it likes to play 2-2 on any other corner move. I think my new equations are slightly insufficient and I need a few extra lines of computations, though I think I have the key variables. |
Author: | kvasir [ Sat May 28, 2022 5:37 am ] |
Post subject: | Re: Numerical evaluation theory of thickness |
RobertJasiek wrote: Knowledge of sophisticated concepts or previously insufficient thought processes are never good excuses for ignoring the most basic concepts. One very basic concept is to count numbers of stones! Any player can do this! Any player ought to do this because it has important applications for efficiency, joseki evaluation, other balance evaluation, neutral stone difference and influence stone difference. It certainly does helps to count stones in the center, I have assumed this is what you mean by influence stones but I am not sure if this is what you mean. What I tried to say above is that simple influence estimates can be useful if they can tell you if the game is balanced but I doubt the usefulness of methods that claim to predict how outside stones convert into territory based on simply counting the stones. That is not to say that such methods are total nonsense in every situation. When evaluating position with thickness it is important to judge the effect on the game. This can then help with making objective decisions about how to play. The alternative way of finding a target number for converting thickness into territory based on a heuristic can work in some situations but I can't agree that this is an objective way of playing. As an example, saying that one player has a wall of length 6 and now they must find 18 points of territory somewhere is not objective. On the other hand, it is certainly true (as you said) that some heuristics, principles and theories are better than others. RobertJasiek wrote: For other theory, I could only provide bits comparing territory to thickness or influence so far. However, there is also quite some theory by me that relates them by a) partial application of numbers (e.g., only the territory balance or only the influence stone difference) or b) without numbers but using rough conditions, such as a player dominating a region. You know where to find this theory of mine. I don't know if you mean your books or something in this forum. dhu163 wrote: I have done some whole board (computer assisted) calculations using equations I've worked out for influence functions, and am shocked to find that it predicts the first move as gaining 13.6 +/- 10. With all these theories floating around maybe someone can try their theories on this position from a pro game and explain how they help with playing. I marked the last move only to show who's turn it is. |
Author: | RobertJasiek [ Sat May 28, 2022 6:15 am ] |
Post subject: | Re: Numerical evaluation theory of thickness |
kvasir wrote: It certainly does helps to count stones in the center, I have assumed this is what you mean by influence stones but I am not sure if this is what you mean. It is immaterial whether center or not. What matters is the outside for the sake of future potential. The "influence stone difference" (around a region) is the number of Black's outside stones with significant influence minus the number of White's outside stones with significant influence. Quote: What I tried to say above is that simple influence estimates can be useful if they can tell you if the game is balanced Or unbalanced. (E.g., territory count 0, a clearly positive influence stone difference and Black to move suggest Black leads.) Quote: but I doubt the usefulness of methods that claim to predict how outside stones convert into territory based on simply counting the stones. That is not to say that such methods are total nonsense in every situation. Sure, usually, a simple count of excess influence stones does not equate a simple amount of new excess territory. However, there are special applications for which something similar is possible. E.g.: 1) Joseki evaluation. Although the influence stone difference does not predict the amount of future territory, it is related to the current local territory count and to the pure local stone difference. 2) When a player pushes to take, say, 2 extra points along the edge, we can say that the opponent's new influence stone should, from the opponent's perspective, make at least 2 new points later. Quote: I don't know if you mean your books or something in this forum. See https://www.lifein19x19.com/viewtopic.p ... 38#p273238 |
Author: | dhu163 [ Sat May 28, 2022 6:27 am ] | |||
Post subject: | Re: Numerical evaluation theory of thickness | |||
I'll work on it a bit more today, it is still very crude. Without any changes from birth a few days ago, my engine at no komi says B+1.2 +/- 2 (but as W is to play, probably -7 to this so W+6 approximately). I think it thinks that (15,18) not a real tesuji as it helps W, and (11,17) and (11,18) are too slow, so I also think W is ahead, but not but as much as it thinks. If I remove those two black stones, I get W+11 +/-3, so it thinks those B stones only gain an average of 6 points, much less than the 14 points expected for a good move. Suggestions for B to play: (2,5),(2,10),(3,9),(18,6),(8,11) Suggestions for W to play: (3,9),(2,10),(2,5),(2,8),(4,5) (numbering from top then left) These are pretty bad suggestions in my opinion, but actually the top suggestion for W is probably one of the top moves. For comparison, katago says with W to play, it is B+4 on the board, so I am off by 10 points. Katago suggests (14,13), (2,18), (5,12), (3,9), (14,8). some notes on difficulties several hours later: There are still problems, but I could cry. It predicts the 3-3 invasion as well as the small knight's move!!! On the other hand, computation is several times slower. Also I removed a factor of two in the equations to make the output much more correct. But I don't understand why that is correct. I suspect it cancels the error that it thinks one eye is enough for life. At the moment the komi prediction is B+8.34 +/- 0.11 after 1000 rounds. B+8.82 +/0.03 after 2000. Clearly my error estimate is not taking all error into account, but I'm not too bothered atm. In my opinion the number one problem now is LD understanding, and not using the chain rule (my calculus is only first order). However, rerunning on kvasir's position, it says W+22.16 +/- 0.07 after 2000 rounds, with move suggestions (2,12),(2,10),(3,2),(4,5),(7,5). After 3000, W+22.20 +/- 0.03, (7,5),(6,2),(2,10),(4,12),(2,12). I am impressed it found the attachment (4,12). What went wrong? I think it is underestimating the problems at weak points in general. It needs to anticipate B's possible attacks even if they are premature right now. OOTH, I have an idea for patching the 1 eye problem. (10 mins later) At least it no longer suggests tengen. Now opening is B+8.45 +/- 0.06, (8,2),(7,2),(2,16),(11,11),(14,10). I can attach a picture its thinking in Wgain.png (W is where W wants to play, B is where W doesn't). The control estimate is control.png Even though the engine is very weak, I would like to stress how much simpler to understand these equations are than neural networks, though it will still take math and go skill to interpret results. Compared to neural networks, pure equations are zero learning. I think its not bad at finding good shape points overall, but it often plays too deep or doesn't defend weak points.
|
Author: | RobertJasiek [ Sun May 29, 2022 10:41 am ] |
Post subject: | Re: Numerical evaluation theory of thickness |
I lack time for a full analysis. To start with, what is the LD and territory status in the lower right? |
Author: | kvasir [ Sun May 29, 2022 11:45 pm ] |
Post subject: | Re: Numerical evaluation theory of thickness |
It would be a mistake to not fight the ko but besides taking the ko stone white's shape appears alive or escaping to the outside. My mind is tainted by KataGo but I can suggest this black follow up, assuming no escape and no effect on the outside. Maybe it is useful for endgame purposes but I am not sure if it is because it is a very strong assumption, something that may never be realistic. If black tries the same thing right now: I was toying with a precise endgame evaluation of this group myself yesterday and chose this follow up for black. |
Author: | dhu163 [ Mon May 30, 2022 7:13 am ] | |||
Post subject: | Re: Numerical evaluation theory of thickness | |||
several more updates and in many ways it has mellowed, but it is certainly sharper around weak points and boundaries, though it still likes crude pushes and cuts. For around 3000 rounds, 3 minutes, B+9.71 +/- 0.13 (note it is much less confident) Top moves (6,7), (7,12), (15, 9), (12, 6), (16, 18) __ For opening board, 1000 rounds, B+2.96 +/- 0.01, top moves (16,14), (16, 15), (6, 18), (10,16), (15, 14) __ Now I think the main problem missing is that it does zero reading, and that its attention is too local (only on neighbours, so it can't easily even understand that corners are more valuable than the centre), and it misses double attacks, but I'm not sure its worth spending more time on this, though I am definitely happy with progress. Also, it assumes independence, which can go wrong in semeai for weak groups. It doesn't see snapback. __ The key correction required seems to be equations that find two eyes for every point counted as territory. __ 20220530 signing off for a while. The pictures are very pretty, but really not as justified as they could be. Getting balance for pure control estimates is too fiddly. I think flow variables for sente/gote/eyespace should be more useful. Perhaps I'll investigate. It seems difficult to combine value in without actually doing the reading. I probably can't expect too many nice general equations, but perhaps only special cases. 20220708 I find it difficult to think clearly about this problem. A tiny thought on how to make a balance equation: perhaps if a position is dangerous for your opponent (you have some threats), then you are more willing to take on danger yourself to claim the profit. Note that this is a choice though and you don't have to.
|
Author: | mart900 [ Thu Jun 30, 2022 1:28 am ] |
Post subject: | Re: Numerical evaluation theory of thickness |
New to this forum, I was reading this interesting thread and thought John hit the nail on the head. I assume it's allowed to bump month-old threads. John Fairbairn wrote: I'm encouraged that at long last at least some people are making the distinction between thickness and influence. But there's still a long way to go if we want to get into synch with the Japanese pro usage of the words. Responding specifically to the bolder part: I think it's because professionals, unlike amateurs, understand the game well enough to know that reducing it to math, while theoretically possible, is simply too complicated for a human brain. Decades of failed attempts to create strong go AI by explicitly programming strategic concepts basically confirms it.I think in particular that the first step is to make a case that a numerical evaluation of thickness can (or even should) be made. To say it is necessary for computer algorithms to work is no sort of case for humans. In my now rather vast compendium of index of Go Wisdom concepts, thickness is one of the biggies. For example, in Kamakura there are about 40 instances for just 10 games, and that is without counting closely related topics such as thinness, walls, and influence. In not one of those instances, as far as I can recall (and likewise in all the other GW books), is there even a hint of a pro attaching a number to thickness. It is true that a couple of Japanese pros have written books in which they appear to put a value on potential territory associated with thickness, but (a) the value is on the territory not the thickness and (b) they don't appear to use such numbers in their own games/commentaries. I infer these books are just sops to lazy amateurs, and maybe even ghost-written by amateurs. The much quoted 3 points per stone heuristic is something I associate with Bill Spight, though I think he told me once that he got it from someone else - certainly not a pro. The related heuristic of 6 points per stone in a moyo is something I heard from Korean amateurs. I've never seen it linked with a pro. So, apart hearing why thickness should be counted, it appears we need also an explanation why amateurs cleave so much to counting thickness (and other things) whereas pros don't. In real, pro-commentary life, the way thickness is talked about is rather about the way it adumbrates the game. It provides context. It determines strategies (including strategic mistakes). It's a gross form of signposting. It tells you what you can or should do next, or shouldn't do. And, along those lines, the one phrase that comes up most often in pro talk about thickness is "keep away from thickness - including your own". That's seems a lot more valuable than numbers of spurious accuracy. It comes down to a sort of Dunning-Kruger-like concept that I've noticed in other games as well. You would get new players asking oversimplified questions, and good players answering "it depends." In Go, for example you could have someone asking "How many points is a 4-stone wall worth?" and obvious to anyone with a basic understanding of the game is that you can't answer it except with "it depends." It seems amateurs, due to our limited understanding of the game, are understandably more likely to believe that it's reducible because they don't know the extent of this "it depends." They are more likely to think we can reduce the wall question to a few core concepts, run the numbers and get an answer, but the intricacies are lost on them and the answer is unlikely to be accurate. Pros would know this. Don't get me wrong, futile though it may be, attempting this is still no doubt an interesting and fun pursuit. And I suppose another reason we like to try to put it into numbers in particular is that numbers are unambiguous in their value. You ask a pro to explain the value of a thick group in one game, and then a similar thick group in another game, you're not going to know which of the two was worth more from their answer. |
Author: | dhu163 [ Thu Jul 07, 2022 5:09 pm ] |
Post subject: | Re: Numerical evaluation theory of thickness |
re: highlighted paragraph I assume your point is that in order to play a good move, you need to focus on profiting from the value of weak points before your opponent does, so think less about thickness. However, if you want to judge the score of a position, you start needing to think about how thickness affects life and death around. Or perhaps in other words, that isn't even the really difficult part of the game. A few averages will work OK to estimate thickness. But reading the boundary of life and death can take arbitrary amounts of time. Determining whether a move is 2,1,0,-1 moves away from threatening a large group or cutting point can change its value drastically. I mostly agree with both your points, and yet I still believe there are ways to improve the mathematics towards much better strength without needing too much more computing power, even if it might be beyond me. Of course, we default to asking the great strength of AI which is very efficient (though its training was very costly), but I don't think it is the end yet. |
Author: | dhu163 [ Thu Jul 14, 2022 7:35 am ] | ||||
Post subject: | Re: Numerical evaluation theory of thickness | ||||
I have realised there is a simple trick to improve the balance, though the crude means some unjustifiable assumptions. However, I think it has greatly improved strength, though it is much slower to compute (I think a priority needs to be assigned somehow to the computations to cut down unnecessary ones). Also, its first instinct to play around the 2nd line or where it isn't alive yet remains, though with more computation, it does seem to reconsider more. The maximum influence on a neighbour is bounded at 1, but this doesn't really appreciate the extra value of eyespace when weak, mostly by overestimating the value of weakly controlled areas. The score estimate seems to vary more freely now with more computation (less precise but it shows it is thinking about something), and there are also checkerboard patterns for gain that I can explain but can't easily get rid of. (they seem to act like a sign of life that it is computing something, a bit like oscillations have a non-zero energy). Another problem I don't much understand is that it outputs a gain of at most 2 per move normally when it should be 14. The checkerboard probably isn't helping, though the shielding is a nice feature. I think all these problems are linked to not checking for 2 eyes. And yet I'll just be content with the strength improvement for now. It still wasn't able to win at 15k, but as usual the centre shape was fine but it couldn't hold onto it. I attach output after 1000 rounds, 200 seconds. B+7.34 (accounting for who to play) lots of good top move suggestions were outputted, but at the end, it suggested nothing so good W to play: [(5, 2), (4, 7), (2, 17), (11, 10), (2, 7)] B to play: [(11, 5), (14, 7), (12, 5), (17, 14), (14, 6)] I can't completely claim it does zero reading now as it does some fuzzy (fractional) addition of imagining moves that it wants to play on the board for both sides. 30mins later: some tweaks and gain is up to 8 in some situations (though 2 still seems normal). However, it now wants to play on the 1st line. This seems to be because it overemphasises the principle that your opponent's best point is your own and both seem very determined to play on the 1st line. It doesn't blow up, but it reaches a high stable point. It lost again at 15k, but it was leading most of the game this time. But towards the end though that an eyeless stick in the opponent's area was alive and didn't both to defend its own area. My eyespace patch months ago was a bit too simple. But I may want to rehaul that whole part of the system. I like that the system tries to start from first principles. However, I think the next step in strength requires concepts like group and eyespace. As a mathematician, it is nice to get a fuzzy proof for why such concepts are necessary.
|
Author: | dhu163 [ Fri Jul 22, 2022 3:28 am ] |
Post subject: | Re: Numerical evaluation theory of thickness |
My intuitive feeling for eyespace theory. Meaning of temperature in go relative to thermodynamics Definitions philosophy, (e.g. eyespace) Back to the original question. What is the value of a wall? wordy thoughts. still no concrete calculation. Intuition for using semeai results for points evaluation.? An analysis of a basic model for miai defenses that could be extended to double weak points, double defenses, double attacks, etc. Not written into a paper because it is only one variant with several possibilities and I don't understand it well enough, just working through it now. Differential vs absolute Groups, komi Complexity measures Based on above, my naive conjecture is that komi in n dimensions simply grows as 4n-2 + f(n) where f(n) is increasing but small. At the least, I think that it is very unlikely to be higher than quadratic growth. Perhaps f(n)(n/2)(4n-2) is another reasonable possible form. My 2nd guess is [(4n-2)+n(6n-4)g((10n-6)/(4n-2), (2n-2)/(4n-2))+?]/2 for n=2, this gives (6 + 8 * 2* 0.878+?)/2 = 10.024 I'm not sure about my count of aji. If I say that the first move doesn't have 1 influence on the outside but reduced due to aji, how much should I scale down? Probably according to the value of follow ups. If the side moves are the same size to corner still, then counting them as sente isn't unreasonable, after which it is more like g((6n-4)/(4n-2), (6n-4)/(4n-2))=0, which simplifies to (6/2)=3 Somehow we want the answer to be (6+8)/2=7, but this seems like fiddling with equations that aren't justified. Somehow that would work in 2 dimensions, but I can't say I know what will happen in higher dimensions. If such a pattern continues, perhaps [(4n-2)+n/2(6n-4)]/2 = (3n+1)(n-1)/2 isn't an unreasonable guess. in n=3, this predicts k=10. The understanding is that the centre doesn't need to be counted not because it isn't valuable but because the influence is shielded from the centre by the influence of the opponent's forcing moves. Some seeps through as those forcing moves aren't alive, but only after the corner adds a shimari does it really complete it. By assuming side forcing moves are sente on the corner, this suggests that the standard corner moves aren't as good as you might think they are, because the commitment gives the opponent large moves on the sides. However, they main remain locally and globally best because if you say play on the side first, the opponent can play in the corner to counter that direction and may get a larger corner. |
Page 1 of 2 | All times are UTC - 8 hours [ DST ] |
Powered by phpBB © 2000, 2002, 2005, 2007 phpBB Group http://www.phpbb.com/ |