Discussion Zen 5 Speculation (EPYC Turin and Strix Point/Granite Ridge - Ryzen 9000)

Page 425 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

Markfw

Moderator Emeritus, Elite Member
May 16, 2002
25,641
14,630
136
Well, I think Stefan's original concern was w.r.t. what the perf/W will be for Zen5.

I'm suggesting that if it's for MT workloads, then E cores could have improved that, but it looks like there won’t be any on Zen5 DT.

So then the question is what perf/W improvement we can expect from the regular Zen5 cores. There's not been any leaks or even speculation about that IIRC. Or has someone found any such info, or would care to guesstimate?
The problem with e-cores is multiple. When you are using ALL cores 100% of the time, you lose a LOT of performance, and avx-512 is not even allowed, even in the early alderlake's before BIOS updates. I could go on. for Intel they may have a reason for desktop, for distributed computing, none. perf/watt then is a useless metic.

Edit: do you see e-cores in Intel server chips ? Not that I know of (could be wrong), AMD c-cores have 100% functionality, just run a little slower for more density.
 
Last edited:

StefanR5R

Elite Member
Dec 10, 2016
5,600
8,036
136
Intel's e cores are currently only available in hybrid CPUs, that is, in combination with differently performing p cores. Already for this one reason, Intel's e cores are not suitable in compute nodes, as they require homogeneous performance and features across all cores.

Some distributed computing enthusiasts do have Intel p+e CPUs, but even though an e core performs roughly similar to one p HT thread, these CPUs are still awkward to handle in a distributed computing node. Just recently I heard of weird issues with Windows' CPU time accounting on these CPUs. And way before that I saw several reports of performance problems of multithreaded distributed computing applications on these CPUs, which are completely to be expected and can only be worked around by restricting the application to run on cores of same type. [EDIT: I wrote this post before seeing @Markfw's post which already points this out.]

Sometime soon, pure e core server CPUs should become available from Intel. But these many-small-cores CPUs are not targeted to HPC. They could be useful in some sub-niches of the HPC niche but won't be as flexible as pure p core CPUs.

Somewhat similar, Zen 4 dense is targeted to cloud hyperscalers and edge computing at this time, and certainly not to HPC. It remains to be seen whether or not Zen 5 dense's targets will extend further, but AFAICT classic HPC will still be served by Zen 5 non-dense.

On the topic of "efficiency": Let's not forget what Intel's e cores are primarily efficient at: In die area. Would have been nice if Intel had called them "ae cores", for "area efficient cores". In #10,842 I chose the terms area optimized, performance density optimized, and performance efficiency optimized. They are correlated to a degree but not the same. For FP performance density and efficiency, you don't go for cores with cut-down FP execution units. For cache-churning applications, performance and energy efficiency suffer if you go for CPUs with cut down caches…

EDIT 2: For energy efficiency at the application level, energy efficiency of the host (and rack…) is what counts. Due to base consumption by memory, cooling, and so on, very low power CPU cores are typically not putting you into the energy efficiency sweet spot of compute nodes, even in the lucky case that your application is able to scale to a large number of cores with negligible overhead.
 
Last edited:

Mahboi

Senior member
Apr 4, 2024
522
836
91
Now that we've smoked out the rat, I can't wait for details on Zen 5 LP.
Not just the perf, but what did they take out, power draw, etc.
Someone already mentioned that with the split AVX 512 implementation they had in Raphael, they can keep AVX 512 including in the LP cores. But will they?
I want a full breakdown of the thing, see what was taken off, etc. I love low power stuff.
 

soresu

Platinum Member
Dec 19, 2014
2,721
1,921
136
Now that we've smoked out the rat, I can't wait for details on Zen 5 LP.
Not just the perf, but what did they take out, power draw, etc.
Someone already mentioned that with the split AVX 512 implementation they had in Raphael, they can keep AVX 512 including in the LP cores. But will they?
I want a full breakdown of the thing, see what was taken off, etc. I love low power stuff.
Be interesting to see areal density too.

Given Bergamo was only a 1.33x increase in cores over Genoa and the Zen5 successor is supposed to be more like 1.5x there must be a significant difference in layout there too.
 

Mahboi

Senior member
Apr 4, 2024
522
836
91
I'm preparing a little video (no I'm not trying to be a MLID/RGT, it's a different kind of video) about Zen 5, can we recap what we know about its internals?
- 8 wide decode
- Same or higher clocks
- SPECINT +40%
- full width AVX 512 implem

What else?
 

itsmydamnation

Platinum Member
Feb 6, 2011
2,804
3,267
136
I'm preparing a little video (no I'm not trying to be a MLID/RGT, it's a different kind of video) about Zen 5, can we recap what we know about its internals?
- 8 wide decode
- Same or higher clocks
- SPECINT +40%
- full width AVX 512 implem

What else?
we know it isn't 8 wide decode , it does something in decode but we don't know exactly what, two fetch blocks is all that is listed. is that parallel, used for branches etc .

I think the only things we can say for certain are from this slide
 

H433x0n

Senior member
Mar 15, 2023
926
1,013
96
Turin is N4P or N4X (not sure which tbh) which is in the same family as N5. Just better/more refined.
I’ve heard from people that I consider reliable that it uses N4X. I’ve got a hard time believing it since N4X was regarded as a bit of a meme.

There’s some tidbits that support it though like the increased over Zen 4 and the higher reported power consumption for desktop parts.
 
Jul 27, 2020
16,824
10,773
106
Some distributed computing enthusiasts do have Intel p+e CPUs, but even though an e core performs roughly similar to one p HT thread, these CPUs are still awkward to handle in a distributed computing node. Just recently I heard of weird issues with Windows' CPU time accounting on these CPUs. And way before that I saw several reports of performance problems of multithreaded distributed computing applications on these CPUs, which are completely to be expected and can only be worked around by restricting the application to run on cores of same type.
Not sure if the Win11 scheduler has been improved but Linux is supposedly better at dealing with hybrid cores: https://www.phoronix.com/news/Linux-6.5-Intel-Hybrid-Sched
 
Jul 27, 2020
16,824
10,773
106
- 8 wide decode
- Same or higher clocks
- SPECINT +40%
- full width AVX 512 implem

What else?
Special instructions to accelerate AI workloads. (Source: AMD slides)

Possibly more performant SMT (due to beefier execution resources). (Source: Hopium)

May hit 6 GHz boost. (Source: Hopium)

First time when a previous gen X3D chip may not be able to touch the nextgen vanilla chip in gaming workloads. (Source: Hopium)

DDR5-6400 will possibly be the base RAM configuration. (Source: Hopium)

RDNA3 iGPU should beat current Intel Core Lake iGPU and Ryzen 7000 series desktop iGPU. May get beaten by Arrow Lake iGPU. (Source: Hopium)

May beat Zen 4 vanilla chips in ECO mode. (Source: Hopium)
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |