Future Multi core cpus have many benefits:
For instance with several different cores you could run a fast core or two and some slower cores. The slower cores can take care of doing things like the background virus scan and the overheads from perpetuals such as USB devices (im on a 800mhz single cpu system my mouse uses noticeable cpu resources if I move it about with task manager open) Or mabey processing driver calls i read some place that dx9 etc uses some times upto 50% of the cpu resorces to provide the gpu with what it requests and a key reson why x box does so well is the effient implimentation of this.
The fast core could be used to game on and because like the article says there could be the possibility for multiple levels of cpu speed on the full carrier the heat output could be more reasonable and performance in comparison with single threaded single cpu situations.
Due to limitations in how much multithreading is practical, fast cpus are still necessary in the package. Implimenting two speed grades cores on one cpu would enable good performance in all situations.
For instance a render farm some times uses thousands of cpus to render all the frames in a movie. The high level of parallelism here is suited for many many slower cpus because they can work on there process and all return there work to the central pc. However in a real-time render such as a game the next item to process is almost random, does the character stop? Jump? Die? Here it is not possible to render the next step simultaneously as it is not planed. However physics, A.I. and the players interaction with the game(input and output ie render) could be prime candidates for threads in this case.
Allowing the cpu to do the actual cpu resource allocation is an interesting, in fact possibly good, idea but in a multi core multi cpu situation harder to implement and its the server farms that will benefit most from these where cpu density is vital.
This problem of resource allocation could be treated in a similar way to cache coherency on smp systems. AMD implemented a 'snoop bus' on the old k7 mp chipset, this dedicated bus idea can instead of being used just to insure cache coherency could be used to inform the other cpu how busy its partner is. When one cpu is trying to allocate a thread and the other cpu has the resources to process it the thread could be transferred to the second cpu using high speed bus similar but much faster than hypertansport.
The cpu would require processes executing on it to have a id that would dictate to the allocater the priority of the task and an estimate of the complexity of the task. all these things would complicate the implementation of hardware and code. A complexity estimate would decide how fast the core that processes it would need to be, the game being complex the mouse not so.
Looking closer to today a dual core will bring the benefits of smp to the masses! Hyperthreading's a start but a true dual is grate to use.
I have a smp system, one things sure is that you never want to go back unless you are a compleatly obsesivegamner and the diffrence between 70 and 75 fps are crucial.
whats smp and why have a dual
http://www.ocforums.com/showthread.php?t=78677