Discussion Zen 5 Speculation (EPYC Turin and Strix Point/Granite Ridge - Ryzen 9000)

DisEnchantment · Sep 29, 2022

Speculate at will

DrMrLordX · May 12, 2024

Markfw said:
But Genoa is 5nm, and Turin is supposed to be 3 or 4 nm ?

Turin is N4P or N4X (not sure which tbh) which is in the same family as N5. Just better/more refined.

itsmydamnation · May 12, 2024

Mahboi said:
I'm preparing a little video (no I'm not trying to be a MLID/RGT, it's a different kind of video) about Zen 5, can we recap what we know about its internals?
- 8 wide decode
- Same or higher clocks
- SPECINT +40%
- full width AVX 512 implem

What else?

we know it isn't 8 wide decode , it does something in decode but we don't know exactly what, two fetch blocks is all that is listed. is that parallel, used for branches etc .

I think the only things we can say for certain are from this slide

Zen 5’s Leaked Slides

A YouTuber called Moore’s Law is Dead recently leaked a couple AMD slides about Zen 5. I typically find leaks uninteresting as they are impossible to verify and often don’t correspond t…

chipsandcheese.com

jarablue · May 12, 2024

Does anyone know when the 9000 series will hit the stores? Microcenter here I come!

H433x0n · May 12, 2024

DrMrLordX said:
Turin is N4P or N4X (not sure which tbh) which is in the same family as N5. Just better/more refined.

I’ve heard from people that I consider reliable that it uses N4X. I’ve got a hard time believing it since N4X was regarded as a bit of a meme.

There’s some tidbits that support it though like the increased over Zen 4 and the higher reported power consumption for desktop parts.

RnR_au · May 12, 2024

jarablue said:
Does anyone know when the 9000 series will hit the stores? Microcenter here I come!

Early Q3 is one estimate.

Thunder 57 · May 13, 2024

RnR_au said:
Early Q3 is one estimate.

What about @adroc_thurston saying it would already be available? Surely he is an insider, no?

gdansk · May 13, 2024

Thunder 57 said:
What about @adroc_thurston saying it would already be available? Surely he is an insider, no?

April but now July.

adroc_thurston · May 13, 2024

gdansk said:
April but now July.

April was the launch, not the availability window.

Thunder 57 · May 13, 2024

adroc_thurston said:
April was the launch, not the availability window.

Lol, you are just making crap up. So what was April, when they started HVM?

adroc_thurston · May 13, 2024

Thunder 57 said:
Lol, you are just making crap up

There's no need to be upset.

Thunder 57 said:
So what was April, when they started HVM?

Mobo vendor timeline.

igor_kavinski · May 13, 2024

Markfw said:
I have 352 Genoa cores myself.

On a single socket? Are you able to utilize them properly or do you need to resort to putting your workloads in VMs for better core occupancy?

igor_kavinski · May 13, 2024

StefanR5R said:
Some distributed computing enthusiasts do have Intel p+e CPUs, but even though an e core performs roughly similar to one p HT thread, these CPUs are still awkward to handle in a distributed computing node. Just recently I heard of weird issues with Windows' CPU time accounting on these CPUs. And way before that I saw several reports of performance problems of multithreaded distributed computing applications on these CPUs, which are completely to be expected and can only be worked around by restricting the application to run on cores of same type.

Not sure if the Win11 scheduler has been improved but Linux is supposedly better at dealing with hybrid cores: https://www.phoronix.com/news/Linux-6.5-Intel-Hybrid-Sched

igor_kavinski · May 13, 2024

Mahboi said:
- 8 wide decode
- Same or higher clocks
- SPECINT +40%
- full width AVX 512 implem

What else?

Special instructions to accelerate AI workloads. (Source: AMD slides)

Possibly more performant SMT (due to beefier execution resources). (Source: Hopium)

May hit 6 GHz boost. (Source: Hopium)

First time when a previous gen X3D chip may not be able to touch the nextgen vanilla chip in gaming workloads. (Source: Hopium)

DDR5-6400 will possibly be the base RAM configuration. (Source: Hopium)

RDNA3 iGPU should beat current Intel Core Lake iGPU and Ryzen 7000 series desktop iGPU. May get beaten by Arrow Lake iGPU. (Source: Hopium)

May beat Zen 4 vanilla chips in ECO mode. (Source: Hopium)

del42sa · May 13, 2024

Mahboi said:
I'm preparing a little video (no I'm not trying to be a MLID/RGT, it's a different kind of video) about Zen 5, can we recap what we know about its internals?
- 8 wide decode
- Same or higher clocks
- SPECINT +40%
- full width AVX 512 implem

What else?

you forget AI gimmick

Mahboi · May 13, 2024

I do happily forget the AI gimmick every single day, and forget it again as soon as anyone mentions it.

igor_kavinski · May 13, 2024

Markfw said:
I think Stefan was talking about 32-128 fat cores with avx-512 for DT performance

Intel can't even afford to put 16 fat cores. Power consumption would either shoot beyond 500W or the cores would be power starved if Intel limits the TDP.

StefanR5R · May 13, 2024

Mahboi said:
Now that we've smoked out the rat, I can't wait for details on Zen 5 LP.
Not just the perf, but what did they take out, power draw, etc.

soresu said:
Be interesting to see areal density too.

Apparently there will be only a small number of LP cores. [Purpose: to host background tasks in idle situations/ connected standby maybe — not to prop up Cinebench. ;-) ] Thus, areal density, while not unimportant, may not be a central design goal. For Zen 5LP, that is.

soresu said:
Given Bergamo was only a 1.33x increase in cores over Genoa and the Zen5 successor is supposed to be more like 1.5x there must be a significant difference in layout there too.

Genoa and Bergamo still have some spare room under the lid. (According to published photos, not that I'd delidded one myself.) I guess the new IOD for Turin and Turin-Dense could be a more slender rectangle than Genoa's and Bergamo's IOD, for some more "shore line" to area ratio. Both for putting the additionally needed GMI links on the chip and to facilitate their routing on the package.

The 96 core Threadripper 7995WX however is very crowded under the hood. But whether there will be a direct Zen 5 based successor to this one remains to be seen anyway.

Markfw said:
I have 352 Genoa cores myself.

igor_kavinski said:
On a single socket?

4x 64c/128t and 1x 96c/192t according to Mark's signature. All 1P I think.

igor_kavinski said:
Are you able to utilize them properly or do you need to resort to putting your workloads in VMs for better core occupancy?

In Distributed Computing, we often run n instances of single-threaded processes. This scales without problem to so many threads (on Linux; I am not up to date with Windows). Sometimes we run fewer instances of multi-threaded processes. With some of such applications, performance suffers a lot if threads of one such process end up running on different CCXs. That has been an issue with Zen 1...4 and obviously will remain with Zen 5. Hard to say what will happen with Zen 6 with its substantially changed SOCs. (Or Strix Halo already, in fact.) The problem is two-fold: Inter-thread shared data gets onto more caches than strictly needed, and inter-thread communications beyond CCX boundaries is slow and energy costly. But we don't need VMs or even containers to solve this; we can do this with helper tools, or in case of EPYCs can use a BIOS option which (ab)uses NUMA hints to coerce NUMA aware operating system into cache-aware thread scheduling. (Neither Windows' nor Linux's kernel implement a cache-aware scheduling policy. The kernel developers probably have their reasons to leave this to userspace to handle.)

EDIT: CPUs with unified last level cache would remove this inconvenience. But the price to pay would be higher latency of the last level cache, and higher chip manufacturing costs.

EDIT 2,

StefanR5R said:
Some distributed computing enthusiasts do have Intel p+e CPUs, but even though an e core performs roughly similar to one p HT thread, these CPUs are still [...troublesome...]

igor_kavinski said:
Not sure if the Win11 scheduler has been improved but Linux is supposedly better at dealing with hybrid cores: https://www.phoronix.com/news/Linux-6.5-Intel-Hybrid-Sched

When Inte'ls offer was 8c/16t + 8c/8t, even with scheduling like that (regardless if implemented in kernelspace or userspace), you are left with a large asymmetry. It is now better with 8c/16t + 16c/16t at the top end, but still not symmetric.
As an aside, recall how Intel solved this in their LGA1700 Xeon line: ark.intel.com (Though this line is not targeted to compute servers.)

StefanR5R · May 13, 2024

igor_kavinski said:
Possibly more performant SMT (due to beefier execution resources). (Source: Hopium)

What about: Possibly stagnant or even lower SMT uplift, despite beefier execution resources, due to much improved frontend; source: Hopium? :-)

Goop_reformed · May 13, 2024

StefanR5R said:
What about: Possibly stagnant or even lower SMT uplift, despite beefier execution resources, due to much improved frontend; source: Hopium? :-)

My hopium is for >30% st uplift.

igor_kavinski · May 13, 2024

Goop_reformed said:
My hopium is for >30% st uplift.

It will be hopefully higher in certain cases. Some applications/games will benefit more from the expanded resources than others that are bottlenecked elsewhere, either due to bad programming or simple limits of x86 instruction execution.

del42sa · May 13, 2024

Goop_reformed said:
My hopium is for >30% st uplift.

get copium instead https://wccftech.com/intel-royal-co...nther-cove-cpu-architecture-tackle-amd-zen-5/

Thibsie · May 13, 2024

WtfTech using Mlid as source. ROTFLMAO

adroc_thurston · May 13, 2024

Thibsie said:
WtfTech using Mlid as source. ROTFLMAO

somebody post the human_centipede.png again

Makaveli · May 13, 2024

Thibsie said:
WtfTech using Mlid as source. ROTFLMAO

They have also used comments from this thread to source articles.

Joe NYC · May 13, 2024

del42sa said:
get copium instead https://wccftech.com/intel-royal-co...nther-cove-cpu-architecture-tackle-amd-zen-5/

From August 2021

Discussion Zen 5 Speculation (EPYC Turin and Strix Point/Granite Ridge - Ryzen 9000)

Golden Member

Lifer

Platinum Member

Member

Senior member

Golden Member

Platinum Member

Platinum Member

Platinum Member

Platinum Member

Platinum Member

Lifer

Lifer

Lifer

Member

Senior member

Lifer

Elite Member

Elite Member

Member

Lifer

Member

Senior member

Platinum Member

Diamond Member

Platinum Member