No it doesn't. Some high-end GPUs running OpenCL are losing against CPUs, prior to AVX2. You can't ignore this. Mainstream GPUs are much weaker too. And while HSA indeed aims to improve GPGPU efficiency, it will not be widely supported.
That's akin to running CUDA on an AMD GPU and saying it sucks at GPU-computing.
NVIDIA has decided to back away from GPGPU in the consumer market, as evident by the Kepler design sacrifices
From the desktop, yes. From HPC? Workstation? The Kepler desktop cards are essentially the same architecture as their bigboy Tesla/Quadro cards except for GPGPU focused necessities like bus width and additional vRAM, ECC, etc. The desktop GTX680s are neutered in CUDA because nVidia was losing sales on Quadro/Tesla cards and that's where their margins are higher.
And Intel isn't going to implement HSA either
No, but they've implemented openCL. Their "HSA" is a ton of little x86 cores aka MIC.
Hence HSA will be limited to a fraction of the market, also decreasing the "cross-platform" value of OpenCL.
openCL is tied only to HSA in that HSA can benefit openCL performance. You don't need an HSA-style architecture for good openCL performance. Hell, even Intel's CPUs benefit from AMD's openCL APP when it comes to throughput.
Homogeneous throughput computing, spearheaded by AVX2, is far more promising than GPGPU will ever be.
Except that it's not here and tied to only one manufacturer. So you're right, except that that scenario only works if you plan on buying Intel for everything...ever. While you might be okay with that (and if you are I'd highly suggest you look into buying yourself an Itanium PC. I hear their throughput is amazing) but I'm not. Proprietary ISAs = slow progress and no competition. Take a look at the blistering pace we've been moving at on the desktop over the years where on-die GPUs are outpacing CPU advancements 3:1 or 4:1. But you're right, that Haswell GPU should only be good at drawing pretty little triangles rather than helping with compute because utilizing that growing GPU (in die size and performance) is definitely a bad idea.
AVX2 = x86.
If you can name me the last time we saw this kind of improvement over a single generation in x86 with this "competition" between AMD and Intel then please let me know. Btw, that's not an ISA optimized benchmark but all architectural.
That's what true competition looks like and is capable of. Instead we get two manufacturers, a silly instruction set limited to a proprietary x86 and you're claiming it's the winner by default. Good luck with that.
edit - and it's not just qualcomm. The A15 is rumored to have nearly the same throughput thus should perform roughly the same as the S4. I'd rather head to an ISA that isn't tied to a specific hardware manufacturer with real competition (that's where HSA/openCL comes in as ARM is part of the HSA foundation) rather than Intel and AMD beefing up the GPU and a comparative stall on CPU performance.
Last edited: