- Mar 3, 2017
- 1,659
- 6,101
- 136
No, lol.SMT uplift exceeds ST uplift.
Zen3 had more EUs without a corresponding bump in OoO resource but SMT yield went down.Core is wider, so enabling SMT should provide a bigger gain in performance than It did to Zen4 for example.
That 35W 3050M scores around 4500 pts in 3DMark Time Spy graphics.There is discrepancy, true, but look what TDP that RTX 3050 has. Only 35W!
35W RTX 3050 has only 713MHz base and 1058MHz turbo, that's very very low.
Strix Point IGP needs 20-30% higher clocks to match It, that's only 1270-1375Mhz.
For comparison: RX 6550M(RDNA2) has 16CU and 2560MHz boost, 16MB IC and 144GB/s.
With basically ~1/2 clockspeed 102-136.5GB/s(6.4-8.53gbps) should be enough to feed both IGP and CPU in Strix Point.
P.S. If Strix Point had enough BW, then at 2.6GHz It could go even against RTX 4050 35W.
Huh.... Well, 25 % more power for 33% more cores and 25-ish % ( on average) more "IPC" is good tradeoff in my book.It kinda does, 25% socket power bump on the same platform is pretty major.
But the perf signs.
Wow you've almost nailed Turin perf.Well, 25 % more power for 33% more cores and 25-ish % ( on average) more "IPC" is good tradeoff in my book.
Venice is silly expensive.Guess we will have to wait for Zen6 for substantial ppw improvement with compete SoC/package overhaul.
That's not true.That 35W 3050M scores around 4500 pts in 3DMark Time Spy graphics.
780M scores around 2800-2900 pts.
16 CU version will have at best 3-3.2 GHz core clock, and 25% more CUs.
There is no world in which 16 CU Strix Point can achieve 35W 3050M performance.
Yep, true, its 33% I was thinking backwards about its CU count.That's not true.
Top score for RTX 3050M is 5295 pts in 3DMark Time Spy graphics. Notebookcheck.net
Your 4500 pts is only 15% less, yet boost frequency at 35W is 40% lower than at >=80W.
That doesn't make sense.
If I calculated points based on boost frequency difference, then I would end up with 5295*0.6=3177pts.
Btw, the weakest score they have for this GPU is 3281 pts.
4500 pts is clearly for higher TGP.
P.S. 16CU is 33% more than what Phoenix has.
I don't guarantee anything, I just compared the specs and made some simple calculations, but maybe I underestimate how much that 16MB IC actually helps.Yep, true, its 33% I was thinking backwards about its CU count.
I hope you are right with your calculations.
Uh...I don't guarantee anything, I just compared the specs and made some simple calculations, but maybe I underestimate how much that 16MB IC actually helps.
No, lol.
Turin-D SMT on is just a 10% socket-level perf bump over SMT off.
Zen3 had more EUs without a corresponding bump in OoO resource but SMT yield went down.
Zen4 has little in a way for raw EU count bump while having more OoO resource yet SMT yield is up.
Correct. And there will be Zen5 AM5 desktop SKUs with:Zen 5 standard would be N4 with 8 cores per CCD
Zen 5c N3 with 16 cores per CCD
Zen 6 standard N3 with 16 cores per CCD
Zen 6c N2 with 32 cores per CCD
There is no good reason to release 32C64T Zen5C, when It's not capable of high clocks at low thread count.Correct. And there will be Zen5 AM5 desktop SKUs with:
2xZen5 8C = 16C
1xZen5 8C + 1xZen5C 16C = 24C
2xZen5C 16C = 32C
Just called Lisa Su and double checked. Apparently it hasn't rippled down through the hierarchy to adroc_thurston yet.
Yeah.That's what, half (or even less) of what Bergamo gets with the same core count?
Well this is SIR, about as SMT-friendly as it gets really.Granted SMT yield is fairly dependent on which enterprise benchmarks you use
That's also true for Zen5 and every other "new core" by AMD.Zen3 was able to get great utilization out of its additional EUs, had good branch prediction improvements, etc.
Kinda the point and also the reason why you're not really supposed to run Turin-D SMT on.If somehow utilization has improved despite this, everything else being equal, that would manifest as obscene 1T gains while reducing SMT yield
Other way around.Assuming that both the "30%+ IPC" and "no way it's going to be 30% IPC" camps are valid, there's an easy explanation:
SMT uplift exceeds ST uplift.
Building a wide core and feeding a wide core are two different matters. But to the extent that Zen5 has difficulty utilizing its wider structures in ST, it should have an easier time utilizing those structures via SMT.
There is no good reason to release 32C64T Zen5C, when It's not capable of high clocks at low thread count.
It would be a very niche product.
Kinda the point and also the reason why you're not really supposed to run Turin-D SMT on.
Other way around.
Cinememe is a horrible benchmark for server parts. Full stop.But I can't quite seem to square this circle in my head with the 2x64c Turin Cinebench leak, showing only a ~15% uplift
It's not a miracle, you just cover it in generous amounts of OoO and BP and scheduling resources.
- Zen5 is so much wider than Zen 4, and
- AMD engineers pulled a small miracle and Zen5 utilizes its resources so much better than Zen4 does that SMT uplift is somewhere between halved the thirded.
It's a single-threaded core with SMT tackled on or you should at least treat it as such.How do we square this circle?
The point is what core you chose for that 32 core CPU.It's on the optimal point on the efficiency curve. Max MT performance at lowest power consumption. Best perf/watt.
It's why they set a 170W TDP on AM5. The 16C Zen4 was just temporary. The last few extra 100 MHz:es on 16C when using the full 170W are pointless and just consumes a lot of power with very little perf gain.
The long-term intention with the 170W on AM5 was of course preparation for more cores on Zen5 and later, with 24/32C. Quite obvious.
Cinememe is a horrible benchmark for server parts. Full stop.
It's a single-threaded core with SMT tackled on or you should at least treat it as such.
It just doesn't scale to 64c*2p systems.Have we seen different perf/clock uplifts in this benchmark between server and consumer parts in the past for new architectures?
Yeah it would but really depends on the workload.Lowering SMT yield won't lower MT performance
Goto China, buy a candyvan, trap a random Lenovo validation eng in said van, and maybe you'll get something out of it.We need leakz
Oh ye forgot to comment, and slightly tangential, but from Zen4C testing, it appears as if it doesn't win perf/clock against Zen 4 anywhere on the frequency curve despite what AMD said. Implementing the -C cores on a a better node might help differentiate them.Zen5C most likely will clock a lot lower than standard Zen5.
16 Zen5 vs 32 Zen5C
It would lose horribly in programs using only 16 threads.
This CPU would be good only for those who can use all of It.
So a very niche product.
GNR?and with Genoa,Genoa-X and Bergamo do crushing anything Intel has, or is going to release soon,
Channel sales hard.how is it that they STILL can't get more market share
Not made to, it's a power/area play.it appears as if it doesn't win perf/clock against Zen 4 anywhere on the frequency curve
Who tested It? Link, please.Oh ye forgot to comment, and slightly tangential, but from Zen4C testing, it appears as if it doesn't win perf/clock against Zen 4 anywhere on the frequency curve despite what AMD said. Implementing the -C cores on a a better node might help differentiate them.