Re: World rankings
Posted: Mon Feb 14, 2011 12:32 am
How little or much can a single number express about relative player strengths?
1) The number can only describe what is measured. If tournament games are measured but non-tournament games are not measured, then it will not be noticed if different players are weaker or stronger in or outside tournaments. Same if some tournaments are but others are not measured.
2) Even if the numbers suggest a transitive ordering of the players because the numbers themselves have a transitive order, the players cannot be ordered transitively and linearly. This is so beacuse regular cyclical domination like A beats B beats C beats A occurs.
3) A multiple of a double round robin played within a reasonably short time (so that some players do not improve significantly yet) is the best tournament system for getting numbers: They are the number of wins. Double round robin ensures that each player gets each colour equally often. Even so, the numbers do not express a player's general strength but only the one under the particular tournament's conditions like komi, rules and thinking times.
4) A series of regularly occurring tournaments with the same conditions does not assess equally meaningful numbers for all players because some players will not be able to play all rounds or all occurrences of the tournaments. Rather some players' information will be more significant than others'.
5) A set of different tournaments in that all players of the same population play all games do not compare their strengths but only a partial aspect of their strengths because the tournaments will have a particular distribution of tournament conditions, which favour some of the players who play the strongest under them and which will be a disadvantage for others.
6) It is very hard to compare well players with (very) different numbers of played games.
7) Different tournament conditions cannot be expressed well by a single number. Rather every player can have different strengths under different conditions like being weak at lightning and strong at slow games or like being weak at 6.5 komi but strong at 7.5 komi or like being weak under area scoring but strong under territory scoring.
8) If numbers are determined by anything more complicated than simple number of wins, then (rather) arbitrary parameters (and system methods) are required and strengths depend also on how the ranking system is defined rather than only on how a player performs. To make things at least a bit fairer again, a space of various sets of parameters leading to a space of various ranking systems should be tested and expressed as a space of numbers per player rather than a single number.
9) Connecting different player populations (like from different countries) to each other is very difficult because a) relatively few inter-population games are played, b) different players play different numbers / percentages of games as inter-population games, c) tournament conditions in different populations differ, d) the measurement problems in one population occur also on the inter-population scale.
10) Theory of non-trivial ranking systems is being researched more than already well understood.
Conclusion: A world ranking (rating) system relying on a single determined number per player is nothing but an illusion.
1) The number can only describe what is measured. If tournament games are measured but non-tournament games are not measured, then it will not be noticed if different players are weaker or stronger in or outside tournaments. Same if some tournaments are but others are not measured.
2) Even if the numbers suggest a transitive ordering of the players because the numbers themselves have a transitive order, the players cannot be ordered transitively and linearly. This is so beacuse regular cyclical domination like A beats B beats C beats A occurs.
3) A multiple of a double round robin played within a reasonably short time (so that some players do not improve significantly yet) is the best tournament system for getting numbers: They are the number of wins. Double round robin ensures that each player gets each colour equally often. Even so, the numbers do not express a player's general strength but only the one under the particular tournament's conditions like komi, rules and thinking times.
4) A series of regularly occurring tournaments with the same conditions does not assess equally meaningful numbers for all players because some players will not be able to play all rounds or all occurrences of the tournaments. Rather some players' information will be more significant than others'.
5) A set of different tournaments in that all players of the same population play all games do not compare their strengths but only a partial aspect of their strengths because the tournaments will have a particular distribution of tournament conditions, which favour some of the players who play the strongest under them and which will be a disadvantage for others.
6) It is very hard to compare well players with (very) different numbers of played games.
7) Different tournament conditions cannot be expressed well by a single number. Rather every player can have different strengths under different conditions like being weak at lightning and strong at slow games or like being weak at 6.5 komi but strong at 7.5 komi or like being weak under area scoring but strong under territory scoring.
8) If numbers are determined by anything more complicated than simple number of wins, then (rather) arbitrary parameters (and system methods) are required and strengths depend also on how the ranking system is defined rather than only on how a player performs. To make things at least a bit fairer again, a space of various sets of parameters leading to a space of various ranking systems should be tested and expressed as a space of numbers per player rather than a single number.
9) Connecting different player populations (like from different countries) to each other is very difficult because a) relatively few inter-population games are played, b) different players play different numbers / percentages of games as inter-population games, c) tournament conditions in different populations differ, d) the measurement problems in one population occur also on the inter-population scale.
10) Theory of non-trivial ranking systems is being researched more than already well understood.
Conclusion: A world ranking (rating) system relying on a single determined number per player is nothing but an illusion.