This 939 platform memory bandwidth, as estimated from some test data (so result is preliminary), is impressive. Its efficiency is around 86-90%, which is 15-20% (to be confirmed with more 939 test data) better than the P4 QDR dual channel counterpart.
Its effective bandwidth (not max), running at the same memory bus speed, is about 15-20% higher than that of P4 QDR dual channel and 81-89% higher than that of 754 platform or nforce2 dual channel.
Estimation and importance of 939 platform memory bandwidth
A major difference between the AMD 754 and 939 platforms is the memory bus, i.e. 64-bit memory bus for 754 vs the 128-bit memory bus for 939. Here put it some estimate (since 939 is not commonly available yet) to see the potential impact on memory bandwidth performance.
I think there is a significant advantage from the 939 128-bit memory bus and on-chip dual channel controller, it is very different from the nforce2 dual channel which has only few % memory bandwidth improvement over single channel, as shown below.
memory_bandwidth_efficiency = effective_memory_bandwidth / max_memory_bandwidth
1. In the P4 arena, the dual channel QDR efficiency is around 75% with 64-bit memory bus
max_memory_bandwidth = FSB x 4 x 8 = 32 FSB
effective_memory_bandwidth = FSB x 4 x 8 MB/s x 0.75 ~ 24 FSB
2. XP nforce2 single channel efficiency ~ 85-90%
max_memory_bandwidth = FSB x 2 x 8 = 16 FSB
effective bandwidth = 0.875 x 2 x 8 x FSB ~ 14 FSB
3. XP nforce2 dual channel effieiency ~ 90 - 95% (actually should be 45-48%, depends on how it is counted)
max_memory_bandwidth = FSB x 2 x 8 x 2 = 32 FSB
max_FSB_bandwidth = FSB x 2 x 8 = 16 FSB (FSB limits dual channel memory bandwidth)
effective bandwidth = 0.925 x 2 x 8 x FSB ~ 14.8 FSB
4. 754 hardwares have been around for a while, and we have seen its memory bandwidth being around 95%.
For 754 platform, memory bandwidth efficiency ~ 95%
max_memory_bandwidth_754 = 2 x 8 x memory_bus_frequency = 16 memory_bus_frequency
effective bandwidth = 0.95 x 2 x 8 x memory_bus_frequency = 15.2 memory_bus_frequency
E.g. from Maxvla's system screenshot (
http://www.maxvla.com/host/komusa4200b.jpg), a 754 memory benchmark (integer buffered iSSE2) shows the 754 memory efficiency being around 4574/4800 = 95%.
At 300 MHz, the max bandwidth would be 4800 MB/s for single channel, and 9600 MB/s for 128-bit bus (theoretical max).
5. For the 939 128-bit memory bus, there is a good possibility that it could be higher than 75% (the P4 QDR number) due to its direct 128-bit memory bus:
- max_memory_bandwidth_939 = 2 x 16 x memory_bus_frequency = 32 memory_bus_frequency
- At 80%, 300 MHz, the effective bandwidth would be 7680 MB/s
- At 90%, 300 MHz, the effective bandwdith would be 8640 MB/s
- At 95%, 300 MHz, the effective bandwdith would be 9120 MB/s
(ECC is not required in 939).
I think the 128-bit memory bus could be more efficient than the 64-bit QDR, hope it is close to the single channel number ~ 85-90%. This will be confirmed when actual 939 hardwares come out.
(Will see)
- Ref result 1
I have seen number on memory bandwidth for a FX51 to a A64, assuming running same bus frequency. This is an example,
FX51 - 5315 MB/s (dual channel 128-bit bus)
A64 - 2954 MB/s (64-bit bus)
bandwidth_128_bus / bandwidth_64_bus = 5315 / 2954 = 1.8
So assuming the 64_bus has 95% efficiency, then the
128_bus_efficiency would be 95% * 1.8 / 2 = 86%.
- Ref result 2
http://www.ocworkbench.com/ocwbcgi/ultimatebb.cgi?ubb=get_topic;f=29;t=000711
A64-FX (939) DDR400 dual channel - 5763.5 MB/s
A64-FX (939) DDR400 dual channel disabled - 3101 MB/s
Not clear in the result about the memory bus speed, let's assume it is 200 MHz for the math.
The 939 dual channel 128-bit memory efficiency = 5763.5 / 6400 = 90% !!!
Improvement over dual channel disabled = 5763.5 / 3101 = 1.86 (or 86%) (impressive bandwidth improvement)
Summary (preliminary numbers, may vary as more 939 test results become available):
- If further confirmed by more 939 hardwares, this
86 - 90% number on bandwidth efficiency for 939 128-bit is
15 - 20% higher than the 75% QDR of P4 (64-bit).
- At 86-90% efficiency, the effective bandwidth for the 939 128-bit memory bus would be
81 - 89% higher than that of a 754 64-bit memory bus, with assumed 95% memory efficiency.
This higher bandwidth in 939 would have significant impact on memory intensive applications such as video and image streaming, applications using spatially structured data as in scientific computation, ..., as well as 3Dmark01.
PS:
For video, image streaming, data needs to be refreshed constantly from the main memory (L3) to the on chip L2 via the memory bus (same as FSB in P4 and XP) as size of video data >> L2 size at any given time. So the high P4 dual channel memory bandwidth delivers an advantage. For the upcoming 939, I think it would even be better due to its 128-bit memory bus (w/ dual channel controller).
Let BW stands for effective memory bandwidth (not max),
DC stands for dual channel memory controller,
SC stands for single channel memory controller,
for the same bus speed (system bus, memory bus)
BW_939 > BW_P4_DC > BW_754 > BW_XP_DC > BW_XP_SC
at a ratio estimated respectively about
86-90 : 75 : 48 : 47 : 44
or
BW_939 = 27.5 - 28.8 bus (to be confirmed when 939 available)
BW_P4_DC = 24 FSB
BW_754 = 15.2 bus
BW_XP_DC = 14.8 FSB
BW_XP_SC = 14 FSB
Multiply the corresponding number and FSB in MHz will give the MB/s memory bandwidth.
E.g. FSB = 200 MHz, mem_fsb_ratio 1:1, BW_P4_DC = 24 x 200 = 4800 MB/s