Bill, it may be that much of the benefit comes just from slowing down. Also, I've been pausing in my reading and asking, "...ok, but would I play that if I were him?" and often the answer is, "... of course not!" Apparently System 1 can change its mind when you ask it a different question...
daniel_the_smith wrote:To go from 300+ candidates to a handful of reasonable ones is a *much* bigger decision than going on to pick the best out of that handful.
Some more random thoughts:
To get an idea of the amount of work System 1 performs for you, consider the case where you read a few lines 12 moves deep. Say you're inhumanly good at reading and consider 3 followups to every single move (so you consider 3^12 ~= 500,000 positions). Then System 1 contributes about
80 bits to your decision, and System 2 contributes about
17 bits to your decision. This is an enormous overestimate of how much System 2 contributes; typically I only read a couple lines that deep, nothing like 500,000. If I get that deep for about 20 lines (still an overestimate, but I read lots of lines only a couple moves deep), then System 1 is doing ~94 bits of work, and System 2 is doing 4-5 bits of work.
Say that System 2 evaluates 30 positions per minute (2 seconds per position). Then, in one minute, System 2 does 5 bits of work. In 5 minutes of thinking, System 2 could see 5*30 = 150 positions, nearly 8 bits of work. In one hour of thinking, System 2 could see 30*60 = 1800 postions, or almost 11 bits of work. I don't think anybody's System 2 operates much faster than this (2 or 4 times the speed, fine-- more would astonish me). Pros and amateurs alike. The large difference, then, must be how much work their System 1 does. If a 6 dan can read 20 moves ahead, their System 1 is doing something on the order of 160 bits of work. This explains why it's no effort for them to beat me; their System 1 does vastly more work than my Systems 1 and 2 together. They don't even have to get System 2 involved. If a pro can read 25 moves ahead, they're doing ~200 bits of work, more than the 6 dans can ever hope to do.
(On the "bits" I refer to above: the difficulty of a decision can be thought of as how many bits of information it takes to locate it within the search space. For example, if the problem is to identify a particular person on the planet, you need log2(7 billion) ~= 33 bits of information. If you know their gender, that's 1 bit (it eliminates half the population). If you know they're in New York, that's 10 more bits of information-- it only takes log2(8.7 million) ~= 22 bits of information to locate a person in New York. Every bit halves the space you have left to search. For comparison, if you needed 97 bits to locate someone on earth, that would imply that there were 2^97 people on the planet; if that were the case-- each person would have less than a
square micrometer to live in. And yes, I just discovered how cool
http://www.wolframalpha.com/ can be...)
(ETA: Huh, on my other computer in whatever font is being used, squiggles (~) look like minus signs (-). I'm talking about approximate bits, not negative bits above!)