• Welcome to Overclockers Forums! Join us to reply in threads, receive reduced ads, and to customize your site experience!

To be included in Catalyst 5.11: 'Miracle' OpenGL hack

Overclockers is supported by our readers. When you click a link to make a purchase, we may earn a commission. Learn More.
WOW. that's a HUGE increase just from a 'little hack'.

and ATI just NOW discovered this? sounds fishy. Sorta like they were holdin' back all along, but that's just conspiracy theory talkin' there.

I wonder if NVIDIA can pull off some sorta 'miracle hack' like this
 
actually if read carefully
it states that the hack changes the way the gpu accesses
the onboard memory chips
it does not state that it is exclusive to the 1800 series
video cards
imagine what a 9700 pro can do with the hack
 
Raja Koduri, one of the key engineers at ATI, and his team working on programmable memory controllers developed a little tweak that can result in a significant performance increase.

reading it carefully would suggest this only apply to cards with probrammable memory controllers.......the x18xx's. maybe that's reading too carefully? who knows, but we'll find out soon.
 
vulcanman09 said:
actually if read carefully
it states that the hack changes the way the gpu accesses
the onboard memory chips
it does not state that it is exclusive to the 1800 series
video cards
imagine what a 9700 pro can do with the hack

Actually if read more carefully, it is exclusive to cards with programmable memory controller.
 
could they possibly just be utilizing the extra 256 of memory ?

I mean thats what its there for isnt it ?

They have to make a driver for it optimize the new cards, seems like a normal course of action not a hack or some kind of UBER discovery....
 
Rattle said:
could they possibly just be utilizing the extra 256 of memory ?

I mean thats what its there for isnt it ?

They have to make a driver for it optimize the new cards, seems like a normal course of action not a hack or some kind of UBER discovery....

Well, since you are too lazy to read the article :rolleyes:

Our new MC has a view of all the requests for all the clients over time. The "longer" the time view, the greater the latency the clients see but the higher the BW is (due to more efficient requests re-ordering). The MC also looks at the DRAM activity and settings, and since it can "look" into the future for all clients, it can be told different algorithms and parameters to help it decide how to best make use of the available BW.

As well, the MC gets direct feedback from all clients as to their "urgency" level (which refers to different things for different client, but, simplifying, tells the MC how well they are doing and how much they need their data back), and adjusts things dynamically (following programmed algorithms) to deal with this. Get feedback from the DRAM interface to see how well it's doing too.

We are able to download new parameters and new programs to tell the MC how to service the requests, which clients's urgency is more important, basically how to arbitrate between over 50 clients to the dram requests. The amount of programming available is very high, and it will take us some time to tune things.

In fact, we can see that per application (or groups of applications), we might want different algorithms and parameters. We can change these all in the driver updates. The idea is that we, generally, want to maximize BW from the DRAM and maximize shader usage. If we find an app that does not do that, we can change things. You can imagine that AA, for example, changes significantly the pattern of access and the type of requests that the different clients make (for example, Z requests jump up drastically, so do rops). We need to re-tune for different configs. In this case, the OGL was just not tuning AA performance well at all. We did a simple fix (it's just a registry change) to improve this significantly. In future drivers, we will do a much more proper job.
 
Rattle said:
again its the utilization of the new tech, more ram and the 512 ring bus :rolleyes:

Yeah, but it has nothing to do with the extra memory.

It is just an increase in the efficiency of memory controller. Very similar to native comand queing, except that this is actually fast enough to gain an advantage from it.
 
Back