Tami wrote:
I think that hits the nail on the head.
The KGS may be accurate for most players most of the time, but it seems to be based on the assumption that nobody ever improves. Once you have a stable rank, then it becomes extremely hard to change it, no matter how much you win or lose. And, I kind of agree with Robert Jasiek here, it is much easier to play worse than your mark than up to it because on more than one occasion I have been close to a promotion, lost a crucial game and then gone on to lose a string on games out of sheer frustration. I`m sure that experience is not unique. If only it wasn`t quite so like climbing a greasy pole, maybe not so many players would go on tilt so often.
The latest adjustment, the downward one that prompted this thread, came as a nasty surprise - I had been nursing my main account, the heavy one, toward 1d by steadfastly resisting tilty emotions whenever I did lose, and the adjustment undid all that. It also brought my 1d account temporarily back to 1k.
And, yes, rank and ratings graph are important to me. I have been putting effort into improving my go, and I was using these things to measure my progress. Maybe I have little talent for the game and I am only improving in small steps, but I still like to see my graph go upwards over the passing months.
For sure, I totally get it that the system is not intended for providing feedback on players` progress, but only for making a roughly 50-50 win/lose balance. Could it not be though that the 50-50 balance is merely an illusion of a mirage? If, in fact, there are many, many players of different strengths crammed into a small ratings band because of heaviness, then might not their mutual scores tend to even out over time, thereby giving the false impression of accuracy? (Strong 3k beats weak 3k, but weak 3k wins against weak 1k, who then narrowly beats strong 2k, who beats strong 3k, who goes on tilt and loses to weak 3k).
Still, if it`s never going to change, then that`s just too bad. At least it's still fun to play free games and watch broadcasts.
When I get the chance, I like to clear up misconceptions in threads like these. In an
earlier thread I've already addressed some of Robert's concerns of heaviness.
Cliff notes:
I don't have the time to do that sort of thing again (I figure that if someone playing 10 games/day for a year in one of the most stable rank ranges on KGS has no issue moving their rank when it is appropriate, the majority of us should be ok). What I have done though is attach an image, assuming that your Universal go server handle is the one you use on KGS:
It shows your rank graph as well as your winning percentage in rated games and total games played. It is broken down month by month with cumulative totals at the bottom. For reference, I added a horizontal line at 1.5k. I used August 1, 2011 as the start of the dataset and April 30, 2012 as the end. I'll let the L19 community decide if they feel the KGS server is doing a good job of handling these results or if there is an issue with "heaviness", or a bias of losses being weight more than wins.
Regarding the idea of rank "crunching" where you have several ranks in one band. This type of phenomenon is actually something the study linked yoyoma posted would have detected (unless all rank systems tested suffer from it). If you have KGS grouping people together they shouldn't be, then a superior system would be able to identify these mismatches and easily predict the winner in an even game (a full 1 stone difference should represent something like a 75-80% chance of winning for the stronger player in an even game). In the aggregate, the superior system would be able to correctly predict a much larger percentage of even game winners than KGS.
Another way to check for this "crunching" would be to look at winning percentages in handicap games. If players within a band actually represent a several stone span, then when you play someone outside the band there is a good chance there is a strength difference that is not being compensated for by the handicap. The result is that the player with black should have a severe disadvantage in these games (note: handicap games on KGS are already a half stone under-handicapped by default, but now they would be even more so.). If 3k was really "2k-3k-4k" and 4k was really "4k-5k-6k", and 5k was really "6k-7k-8k", it would mean your average 2 stone game should really be a 4 stone game, and black would be in trouble. In the end, you'd expect something like less than 20% of handicap games being won by black.