IDK... If a single core P4 with HT can beat a single core A64 under DC projects like Folding@home. When time for dual core, one can imagine a dual cored Pentium with HT (on each core) outperforming a dual cored A64.Originally posted by: AbAbber2k
As for future products (multicore) it's hard to say since both have dualcore's on the horizon. I don't really see HT suddenly making Intel king of the hill again though.
Originally posted by: Algere
IDK... If a single core P4 with HT can beat a single core A64 under DC projects like Folding@home. When time for dual core, one can imagine a dual cored Pentium with HT (on each core) outperforming a dual cored A64.Originally posted by: AbAbber2k
As for future products (multicore) it's hard to say since both have dualcore's on the horizon. I don't really see HT suddenly making Intel king of the hill again though.
It was never a matter of price (hence why I didn't say dual cored Pentium 4 with HT), only performance/technology & who knows, Intel could eventually migrate HT'd dual cores into the mainstream. As for games, if they're designed for more than 2 threads, there would probably be demand for 3+ core processors then. IIRC according to Gabe (HL2), it would've been difficult (time restraints and cost for little perf. boost) to code for HT. If that's indication of future games + HT well...Originally posted by: MetalStorm
Originally posted by: Algere
IDK... If a single core P4 with HT can beat a single core A64 under DC projects like Folding@home. When time for dual core, one can imagine a dual cored Pentium with HT (on each core) outperforming a dual cored A64.Originally posted by: AbAbber2k
As for future products (multicore) it's hard to say since both have dualcore's on the horizon. I don't really see HT suddenly making Intel king of the hill again though.
The only Pentium chips that will use dual core and HT will be the EE chips, fancy forking out $1000 for one of those badboys? I didn't think so.
The dual cored EE chips might win at some things, I think rendering especially, but as for games, until they become multi-threaded it won't make a difference, and even when they are multi-threaded, if they're only 2 threads, then HT won't make any difference.
I think you're misunderstanding how the pipeline affects Hyperthreading. The P4 has a worse branch predictor than the Athlon and a much longer pipeline, so pipeline stalls are much more expensive on the P4 than on the Athlon. The only thing Hyperthreading does for the P4 is finding something for the CPU to do while waiting for the pipeline stall to be resolved, when it would normally be sitting there idle. So, the shorter pipeline stalls on the Athlon mean it stands to gain much less from Hyperthreading.Originally posted by: BitByBit
I'm skeptical as to whether AMD wouldn't benefit from multi-threading.
If it were true that pipeline depth affected a processor's ability to execute multiple threads simultaneously, then surely Prescott would be a far better multitasker than Northwood?
Do you have numbers to back that up?The P4 has a worse branch predictor than the Athlon
Not off the top of my head, but I have seen several benchmarks to that effect.Originally posted by: CTho9305
Do you have numbers to back that up?The P4 has a worse branch predictor than the Athlon
To be honest, I'm not sure -- I'm not that familiar with the internals of the Netburst architecture specifically, so I'm just repeating what I've been told by people in the know. I assume they meant pipeline flushing.Originally posted by: CTho9305
When you say a longer pipeline means stalls are more expensive - what exactly would cause this? Thinking about in-order execution, I'd expect that a wider, shorter pipe would be hit harder by a single cycle stall than a narrower, deeper pipe (since fewer instructions could have been committed in a given cycle anyway). Out of order execution makes perfomance analysis with stalls really complicated too. When you say stalls, do you mean pipeline flushes? That would make more sense to me.
Originally posted by: Algere
However for DC projects & the like which already benefit from the Pentium 4's logical CPUs (HT) & are programmed to take advantage of more than 2 threads, I see possibilities here.
How so?Originally posted by: ribbon13
Originally posted by: Algere
However for DC projects & the like which already benefit from the Pentium 4's logical CPUs (HT) & are programmed to take advantage of more than 2 threads, I see possibilities here.
Actually HT hinders DC...
Originally posted by: Algere
How so?
It's known you get more S@H WUs done with HT enabled than with it disabled on the Pentium 4. Lots of ppl to back that up in the DC forum.
Of course for single threaded performance you'll get a marginal or so hit in performance, but we're talkin' DC now where the aim is to crunch out as many WUs as possible. Yes one WU will be slower to finish with HT enabled than without but IDK if your forgettin' that with HT you have 2 WUs crunching @ the same time. For instance...Originally posted by: ribbon13
Originally posted by: Algere
How so?
It's known you get more S@H WUs done with HT enabled than with it disabled on the Pentium 4. Lots of ppl to back that up in the DC forum.
When you try to run two units at once each unit takes longer to finish, it adds a global delay. I've been trying to find out where I heard it exactly. I'm pissed because I know I read it on these forums, but the search engine is a piece of #%^$@%^!!11one1
The only thing Hyperthreading does for the P4 is finding something for the CPU to do while waiting for the pipeline stall to be resolved, when it would normally be sitting there idle. So, the shorter pipeline stalls on the Athlon mean it stands to gain much less from Hyperthreading.
Originally posted by: dmens
Eh? SMT wasn't introduced to overcome a design weakness in P4 (there are plenty of other features that specific address netburst glass jaws). It's been around since Willamette. And all processors will benefit from SMT. The only reason not to do it is design time and validation effort.