Originally posted by: dmens
... people getting paid more than I do run lots of simulations to figure out how to tweak the knobs.
Is that jelousy I hear?
Originally posted by: Markfw900
Well, after the 1 zillionth post on how the next generation chips MIGHT do, it's also really annoying. I have as much right t say that is annoying as you do to be annoying.
Originally posted by: Markfw900
I really think all this supposition is a waste. Wait until it comes out, and then discuss. I guess some people have nothing better to do than BS on things that are in the future....
I think conroe is using a very similar architecture compare to amd 64. short pipeline, more work done per clock, low mhz, low power consumption. This is testament to how successful a64 is. They in my opinion basically copied amd 64 by reverting back to classic Petium 3 based designs: pentium-m etc.
Originally posted by: stevty2889
Very interesting article. I don't think the FSB is going to be as much as a limiter for Conroe as the seem to believe however. While netburst chips are very bandwidth hungry, the FSB doesn't seem to have nearly as much impact on pentium-m's, and conroe should be a lot more similar to a pentium-m than to a netburst chip.
Originally posted by: pedramrezai
MrS,
As far as I can remember SSE2 was a major update for intel; at least they claimed it is a mojor instruction set and AMD was forced to include it in its products. Maybe SSE4 has the same fate.
I know that they do not need to increase HTT in order to use faster memories but it only makes sense to use faster memories with faster HTT in a 1:1 basis. I give you an example: WinRAR 2.50 has a benchmark tool an here are some benchmarks:
Athlon FX60@ 13x200=2600 (Dual DDR400) = 457 Kb/s (Neoseeker) Cache=2x1MB
Athlon FX60@ 11x250=2750 (Dual DDR500) = 530 Kb/s (Neoseeker) Cache=2x1MB
Sempron1.6Ghz@8x300=2400(Single DDR600) =605 Kb/s (My system) Cache=1x256Kb
Athlon64@12x300=2400(DualDDR600-ocz4800)=632 kb/s (Neoseeker) Cache=1x?
As you see at a certain frequency it is only mem bandwith that lets higher performance. a
single core A64@2400 with 50% more bandwith gives 38% more performance than a
dual core FX60. I think we must wait to see how future A64 will take advantage of DDR2.
Originally posted by: keysplayr2003
Originally posted by: Markfw900
I really think all this supposition is a waste. Wait until it comes out, and then discuss. I guess some people have nothing better to do than BS on things that are in the future....
Well, that's something you're just gonna have to deal with now isn't it?
Nobody grabs your hand, moves your mouse over just conroe threads and smashes your index finger with a ball pein hammer to enter the thread do they? You may be tired of the threads, but it still appears you want to see what everyone is talking about in these threads.
Originally posted by: dmens
A few nitpicks:
1. The author misintepreted 4-issue. Intel nomenclature defines "issue" as the pipeline width from the frontend to the backend, whereas others define that as scheduler to execution, which intel refers to as "dispatch". Given that definition, the claim that the FSB will affect 4-wide issue is bunk, since that is instruction fetch, barely affected by bandwidth or even latency issues.
2. I fail to see how AMD64 should be considered "next generation" compared to merom, especially when no justification is given. Of course AMD has technologies that merom lack, but the exact same can be said vice versa.
3. The author makes a comparison between merom's FSB frequency and AMD64's on-die memory controller frequency, which is kind of meaningless.
4. Large caches are not the only way to design around lower memory bandwidth. Hard/soft prefetchers do wonders on many workloads... sometimes wiping out the entire latency differential. Naturally that exposes glass jaws, but as long as the common case is fast, the processor will do ok.
5. If anything, 64-bit registers will shrink text size, since the compiler will generates less text for code which manipulates 64/128 bit values. As for larger datasets, is that really true? Since the registers and memory are still accessible at smaller granularities, programmers will not suck up more memory for the sake of it. Although I agree that programs are hogging more memory, in which case, a larger cache will help just as much as anything else.
6. The article makes no mention of FB-DIMM, which conroe/woodcrest will support.
7. The author asserts that the FSB will limit the efficiency of merom's wider issue width. True, but only if merom's buffer structures are narrow in depth... but they're not (sorry I cannot give numbers). Combine that with a high speed FSB, smarter prefetcher, software assist, etc, I believe merom's glass jaws due to memory fetch bandwidth will not be anywhere as severe as the article says.
I agree that CSI will be an equalizer, but the article definitely puts an overemphasis on the FSB, which only makes a performance difference at the high end. As mentioned above, there are many ways to design around the low fetch bandwidth.