This isn't quite right though, as the ratings are continuous, even though the ranks are cardinal. So between a 3.9k and a 3.0k we might expect the stronger to win 2/3 of the game, even though from the perspective of the stronger player he may feel frustration that he wins 2/3 of his games and never seems to rank up.emeraldemon wrote:It seems to me that the ideal rating & handicapping system would strive to handicap every match to a 50% win rate.
KGS ranking revisited
- jts
- Oza
- Posts: 2672
- Joined: Sat Sep 18, 2010 4:17 pm
- Rank: kgs 6k
- GD Posts: 0
- Has thanked: 310 times
- Been thanked: 637 times
Re: KGS ranking revisited
-
hyperpape
- Tengen
- Posts: 4382
- Joined: Thu May 06, 2010 3:24 pm
- Rank: AGA 3k
- GD Posts: 65
- OGS: Hyperpape 4k
- Location: Caldas da Rainha, Portugal
- Has thanked: 499 times
- Been thanked: 727 times
Re: KGS ranking revisited
One adaptation is to use all the variations of komi between 6.5 and 0.5 as appropriate. Of course this doesn't remove the problem entirely.
- wms
- Lives in gote
- Posts: 450
- Joined: Tue Apr 20, 2010 4:23 pm
- GD Posts: 0
- KGS: wms
- Location: Portland, OR USA
- Has thanked: 257 times
- Been thanked: 287 times
- Contact:
Re: KGS ranking revisited
A year or two ago somebody surveyed various rank algorithms applied to go. He used the KGS algorithm (I'd given him what he needed to recreate it exactly), Elo, a couple modern systems (Glicko I think was one?), and his own system. He then used ability to predict game outcomes as his metric of how good a system was. I was happy to hear that the KGS system placed second in his study, behind his own, but ahead of Elo and Glicko. But his system did not consider predictability of rank changes; so KGS' penchant for changing your rank when you don't play, or for it's occasional bumps where everybody goes up or down together, did not count against it.emeraldemon wrote:...There was a competition a while back looking for improvements to ELO that used basically this metric on historical chess data, I believe.
I'm terrible with names but probably somebody here on 19x19 will remember who did the study and where the results are.
-
yoyoma
- Lives in gote
- Posts: 653
- Joined: Mon Apr 19, 2010 8:45 pm
- GD Posts: 0
- Location: Austin, Texas, USA
- Has thanked: 54 times
- Been thanked: 213 times
Re: KGS ranking revisited
http://remi.coulom.free.fr/WHR/wms wrote:A year or two ago somebody surveyed various rank algorithms applied to go. He used the KGS algorithm (I'd given him what he needed to recreate it exactly), Elo, a couple modern systems (Glicko I think was one?), and his own system. He then used ability to predict game outcomes as his metric of how good a system was. I was happy to hear that the KGS system placed second in his study, behind his own, but ahead of Elo and Glicko. But his system did not consider predictability of rank changes; so KGS' penchant for changing your rank when you don't play, or for it's occasional bumps where everybody goes up or down together, did not count against it.emeraldemon wrote:...There was a competition a while back looking for improvements to ELO that used basically this metric on historical chess data, I believe.
I'm terrible with names but probably somebody here on 19x19 will remember who did the study and where the results are.
- emeraldemon
- Gosei
- Posts: 1744
- Joined: Sun May 02, 2010 1:33 pm
- GD Posts: 0
- KGS: greendemon
- Tygem: greendemon
- DGS: smaragdaemon
- OGS: emeraldemon
- Has thanked: 697 times
- Been thanked: 287 times
Re: KGS ranking revisited
Thanks for the link. wms, did the results of that study make you consider trying his algorithm?
-
RobertJasiek
- Judan
- Posts: 6283
- Joined: Tue Apr 27, 2010 8:54 pm
- GD Posts: 0
- Been thanked: 799 times
- Contact:
Re: KGS ranking revisited
Mainly my objection is the system's design errors.jts wrote:So your objection is not that it's erratic per se
-
RobertJasiek
- Judan
- Posts: 6283
- Joined: Tue Apr 27, 2010 8:54 pm
- GD Posts: 0
- Been thanked: 799 times
- Contact:
Re: KGS ranking revisited
This is a simplifying theory but not quite true. When the rating system is bad, then some players can play worse than usual because the system expects them to win much more than 50%, winning that much is tiring, and so they win less than they would if they were not forced into becoming tired. E.g., I (and others, from whom I have heard the same) can win ca. 10-12 games in a row, but then one becomes so tired than winning 20-24 games in a row is out of the question. Rather quickly lost games occur, first 1, then 2, then 4, then 8. The more tired the greater the percentage of lost games becomes.emeraldemon wrote:look at the average win-rate of every player over an appreciable number of games, and find the average distance from 50%.
-
witwit
- Beginner
- Posts: 8
- Joined: Tue Jun 22, 2010 4:50 pm
- Rank: KGS 7K
- GD Posts: 12
- KGS: witwit
- Has thanked: 1 time
- Been thanked: 2 times
Re: KGS ranking revisited
I certainly agree that sudden shifts in the system are not desired, but as other people have mentioned that is a separate issue from being "internally consistent". Trying to make KGS consistent with external systems is not a straightforward problem since there is no way to objectively measure accuracy like you can when judging the internal accuracy of the system, ie how well the system can predict the outcome of a game given their ratings.
It is entirely possible, however, that the player population on KGS, on average, plays enough games outside of the server to make the system more accurate than without this inflation. Moreover, the system obviously places less confidence in these inactivity inflated ranks meaning that if the increase in rank was not warranted the correction should in theory not take too long. Of course whether or not this works in practice depends on the playing habits of the player population, but I can say in the case of KGS that it works well enough for me.This is a simplifying theory but not quite true. When the rating system is bad, then some players can play worse than usual because the system expects them to win much more than 50%, winning that much is tiring, and so they win less than they would if they were not forced into becoming tired. E.g., I (and others, from whom I have heard the same) can win ca. 10-12 games in a row, but then one becomes so tired than winning 20-24 games in a row is out of the question. Rather quickly lost games occur, first 1, then 2, then 4, then 8. The more tired the greater the percentage of lost games becomes.
-
snorri
- Lives in sente
- Posts: 706
- Joined: Fri Jul 02, 2010 8:15 am
- GD Posts: 846
- Has thanked: 252 times
- Been thanked: 251 times
Re: KGS ranking revisited
Please don't. If game-to-game variance is greater than komi as I suspect it as for almost all amateur players or if the average systematic error is greater than that, as it almost certainly is, it doesn't gain anything and isn't worth the confusion it would cause. It's a false precision. When IGS switched to half-ranks and therefore had games with reverse komi for 1-stone differences, it took me some time to adjust. I'm okay with it and now I don't have to recheck the komi in every game but with a continuous komi system I'd have to, so I'd probably just manually set it to some common value before the game rather than some microrank-derived setting.hyperpape wrote:One adaptation is to use all the variations of komi between 6.5 and 0.5 as appropriate. Of course this doesn't remove the problem entirely.
-
RobertJasiek
- Judan
- Posts: 6283
- Joined: Tue Apr 27, 2010 8:54 pm
- GD Posts: 0
- Been thanked: 799 times
- Contact:
Re: KGS ranking revisited
Do you say that an objective external measure of internal accuracy cannot exist or that so far nobody has described such yet?witwit wrote:there is no way to objectively measure accuracy like you can when judging the internal accuracy of the system
- emeraldemon
- Gosei
- Posts: 1744
- Joined: Sun May 02, 2010 1:33 pm
- GD Posts: 0
- KGS: greendemon
- Tygem: greendemon
- DGS: smaragdaemon
- OGS: emeraldemon
- Has thanked: 697 times
- Been thanked: 287 times
Re: KGS ranking revisited
RobertJasiek wrote:This is a simplifying theory but not quite true. When the rating system is bad, then some players can play worse than usual because the system expects them to win much more than 50%, winning that much is tiring, and so they win less than they would if they were not forced into becoming tired. E.g., I (and others, from whom I have heard the same) can win ca. 10-12 games in a row, but then one becomes so tired than winning 20-24 games in a row is out of the question. Rather quickly lost games occur, first 1, then 2, then 4, then 8. The more tired the greater the percentage of lost games becomes.emeraldemon wrote:look at the average win-rate of every player over an appreciable number of games, and find the average distance from 50%.
If I understand, you're talking about a situation something like this:
A player is ranked incorrectly by a system, so the player is suggested even games against players he "should" win against say 80% of the time. But because it's tiring to win so much, he wins less: maybe 60% or something.
My instinct in this situation is to say that his true "should win" percentage is 60%, not 80%. Even if it's true that winning is more tiring than losing (which I'm not sure of), that seems to be a part of what's necessary to win. You can't say "Player A would beat Player B 80% of the time if Player A didn't have to win 80% of the time".
It is true that a person's past games can change how they play in the next games; are you trying to suggest that we should model this?
-
snorri
- Lives in sente
- Posts: 706
- Joined: Fri Jul 02, 2010 8:15 am
- GD Posts: 846
- Has thanked: 252 times
- Been thanked: 251 times
Re: KGS ranking revisited
So in a very real way, it's better to beat someone whose rating is going up than someone whose rating is trending down or staying flat, assuming of course that past performance says something about future results.jts wrote:Well, not necessarily. If your most recent partners decline, you'll decline to. It just assumes that, in the absence of evidence, you can still beat the same people and lose to the same people.
The worst would be to lose to someone whose rating is going down. Ah, but it's such a nuisance to check player's graphs before you play a game. Maybe there should be new stigma marks:
\ = losing record
/ = winning record
- = relatively flat record
Then one can advertise: "no ?~\" in the game description, which is more succinct than "no ?~ or losers"
But no, wait. Some people might take issue with people who only play opponents with a winning record in order to get that extra rating boost. So just like ~ you'd have to have another mark that says, in effect, that you don't play enough losers. I'm not sure what that mark should be...maybe *
-
witwit
- Beginner
- Posts: 8
- Joined: Tue Jun 22, 2010 4:50 pm
- Rank: KGS 7K
- GD Posts: 12
- KGS: witwit
- Has thanked: 1 time
- Been thanked: 2 times
Re: KGS ranking revisited
I meant that an objective measure of internal consistency does exist while an objective measure of consistency with external systems can only be defined by arbitrarily picking another system to compare against.RobertJasiek wrote:Do you say that an objective external measure of internal accuracy cannot exist or that so far nobody has described such yet?witwit wrote:there is no way to objectively measure accuracy like you can when judging the internal accuracy of the system
-
RobertJasiek
- Judan
- Posts: 6283
- Joined: Tue Apr 27, 2010 8:54 pm
- GD Posts: 0
- Been thanked: 799 times
- Contact:
Re: KGS ranking revisited
I am sure, based on the experience of dozens of thousands of games.emeraldemon wrote:Even if it's true that winning is more tiring than losing (which I'm not sure of),
Since one cannot say so, a rating system must avoid punishing players for becoming tired by having to win a too great percentage for too long a time.You can't say "Player A would beat Player B 80% of the time if Player A didn't have to win 80% of the time".
Not model - but avoid.are you trying to suggest that we should model this?
-
RobertJasiek
- Judan
- Posts: 6283
- Joined: Tue Apr 27, 2010 8:54 pm
- GD Posts: 0
- Been thanked: 799 times
- Contact:
Re: KGS ranking revisited
Arbitrarily picking another system is not an objective measure of consistency with external systems. Theoretical insight independent of particular external systems possibly can provide an objective measure. It is, however, still unclear which assumptions for theoretical insight can be called objective or arbitrary axioms. Getting a good answer on this is the real difficulty.witwit wrote:an objective measure of consistency with external systems can only be defined by arbitrarily picking another system to compare against.