Posted October 13, 2011

Had AMD gave each core its own FPU, I'm sure the benchmarks would have been a lot better. But it seems the new Bulldozer design is more about manufacturing processors on the cheap rather than delivering high performance.

AMD's market is data centers; unless you do procurement for a data center, they don't much care what you think. Their division of the FPUs is driven by their knowledge of data center workloads, and their decision is one of the reasons they were able to get 8 full cores on the chip and still be able to manufacture it.
The only problem is that, as of right now, they can't really provide the number of units needed to cover serious data center requirements; couple this with the fact that server grade motherboards which can take this processor are lacking compared to their Intel counterparts, and the fact that support overall is better on the Intel side means that businesses and large data don't really have a reason to switch to their architecture.
Also, each module has two ALUs, but only one FPU/SIMD unit. If you're doing a bunch of parallel integer tasks, then yes, this could come close to double the performance. If you're doing a bunch of floating point tasks, as "distributed computing" workloads generally are, then this won't improve performance much at all.
Another thing is that Anand's N x N queen test also shows that Bulldozer has worse branch prediction than the previous generation (branch prediction is even more important for Bulldozer because it has a longer pipeline) as well as that cache latencies are terrible (between 25 and 125% slower than Deneb or Sandybridge processors).
Oh, forgot to add the FX-8150 has a thermal design power of 125 W (for short periods it can spike above 125W, but the long-term average is capped at that level), whereas the i5-2500K and i7-2600K are both rated at just 95W. Another reason most datacenters won't go with this one as power/cooling costs are one of the largest money sinks.
Post edited October 13, 2011 by AndrewC