Discussion Zen 5 Speculation (EPYC Turin and Strix Point/Granite Ridge - Ryzen 9000)

Page 222 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.
Jul 27, 2020
17,159
11,023
106
My impression is that AMD engineers have designed the c cores so that even though each core has lower IPC than a fat core, when they are working in tandem, they work well better together to deliver an almost similar MT uplift at lower power consumption which is just amazing engineering. The only downside is some ST workload ending up on a c core which would bring performance down for it so the Windows scheduler needs to be careful about preventing that.
 

StefanR5R

Elite Member
Dec 10, 2016
5,633
8,107
136
How can a core with halved L3 have the same IPC? Has L3 suddenly become irrelevant?
Level 3 cache is not part of the core. It's part of the core complex.
In mobile parts, AMD has provided less level 3 cache compared to desktop and server parts before "dense" a.k.a. "cloud-native" cores were introduced.

In post #5,522, it was not stated what amount of cache the 8 core comparison part should have.

even though each core has lower IPC than a fat core,
But they have the same IPC in the Zen 4 generation (if the amount of cache is the same, if memory interface is the same, etc.). *Maybe* they won't have the same floating point IPC in the Zen 5 generation anymore, but I guess integer IPC is still the same.

But the inter core communication is somehow better between the c cores.
That'd be a property of the core complex again, not of the cores.
 
Jul 27, 2020
17,159
11,023
106
But they have the same IPC in the Zen 4 generation
So the only difference is that the c cores can't be clocked higher and they are more power optimized? That raises an interesting question. Does the fat core need more circuitry to sustain higher frequency? Or are the higher frequency transistors taking up more space (maybe spread further apart from each other to keep heat down)?
 

StefanR5R

Elite Member
Dec 10, 2016
5,633
8,107
136
So the only difference is that the c cores can't be clocked higher and they are more power optimized?
The Zen 4 "dense" cores are only area optimized (and the major repercussion of it is their lower f_max), not power optimized.

(If you recall spectacular perf/W of Bergamo, even better than Genoa in fully scalable compute-bound workloads, then that's because Bergamo operates even nearer the most power efficient f-V spot then Genoa and has got even more cores on top of the same IOD and RAM foundation as Genoa. Bergamo must work nearer this sweet spot than Genoa simply because Bergamo's power budget per core is lower than Genoa's.)

I don't know whether or not Zen 5 "dense" cores will have more optimization work applied. It was claimed here in this thread that they will continue to be a density play first and foremost.
 

HurleyBird

Platinum Member
Apr 22, 2003
2,697
1,293
136
(If you recall spectacular perf/W of Bergamo, even better than Genoa in fully scalable compute-bound workloads, then that's because Bergamo operates even nearer the most power efficient f-V spot then Genoa and has got even more cores on top of the same IOD and RAM foundation as Genoa. Bergamo must work nearer this sweet spot than Genoa simply because Bergamo's power budget per core is lower than Genoa's.)

I believe that frequency and voltage being equal, the dense cores do consume less power, but I think that's all based on transistor count, distances, and maybe process, rather than anything architectural.
 

maddie

Diamond Member
Jul 18, 2010
4,783
4,759
136
Level 3 cache is not part of the core. It's part of the core complex.
In mobile parts, AMD has provided less level 3 cache compared to desktop and server parts before "dense" a.k.a. "cloud-native" cores were introduced.

In post #5,522, it was not stated what amount of cache the 8 core comparison part should have.
A bit pedantic, but whatever.

But they have the same IPC in the Zen 4 generation (if the amount of cache is the same, if memory interface is the same, etc.). *Maybe* they won't have the same floating point IPC in the Zen 5 generation anymore, but I guess integer IPC is still the same.
In the Zen4 gen, the Zen4c have 1/2 L3 as full Zen4, and AMD still claims same IPC, with no disclaimers as you have inserted, which makes no sense. How you claim (if the cache amount is the same), when it can't be, as they're 2 unique designs, with many, but not all elements shared.
 

adroc_thurston

Platinum Member
Jul 2, 2023
2,813
4,121
96
not power optimized.
Yes they are, literally lower Cac.
even better than Genoa in fully scalable compute-bound workloads, then that's because Bergamo operates even nearer the most power efficient f-V spot then Genoa and has got even more cores on top of the same IOD and RAM foundation as Genoa.
No they both operate about their optimum V/f spots.
 

Saylick

Diamond Member
Sep 10, 2012
3,269
6,752
136
This is something I've struggled to understand. How can a core with halved L3 have the same IPC? Has L3 suddenly become irrelevant?
It’s only going to be a few % lower. Not enough to undo the IPC gains. But in mobile applications the standard Z5 core gets half the normal L3 anyways so I bet Z5c is actually comparable.
 

Gideon

Golden Member
Nov 27, 2007
1,687
3,837
136
It’s only going to be a few % lower. Not enough to undo the IPC gains. But in mobile applications the standard Z5 core gets half the normal L3 anyways so I bet Z5c is actually comparable.
What is the rumored total L3, still 16MB?

Zen5c should support 16 core CCX, so is it shared between 12 cores or partitioned to a 4x "big" core CCX and a 8x"small" core one?

If the latter, it better have at least 16 + 8 = 24MB total L3.
Otherwise it's a de facto L3 decrease per CCX to Zen2 levels (which will hurt in games)
 

FlameTail

Platinum Member
Dec 15, 2021
2,634
1,461
106

TESKATLIPOKA

Platinum Member
May 1, 2020
2,381
2,879
136
This is something I've struggled to understand. How can a core with halved L3 have the same IPC? Has L3 suddenly become irrelevant?
In the Zen4 gen, the Zen4c have 1/2 L3 as full Zen4, and AMD still claims same IPC, with no disclaimers as you have inserted, which makes no sense. How you claim (if the cache amount is the same), when it can't be, as they're 2 unique designs, with many, but not all elements shared.
Because most likely that IPC is measured at 1T, so It doesn't matter If you use Zen4(5) or Zen4(5)c core, you still have the whole L3 for that single core.

BTW, I question If L3 is really partitioned between standard and dense cores.

This is PHX2 and L3 doesn't look like It's separated physically.
I think It's not separated at all. Just the amount is 16MB for 12 cores.
 
Last edited:

maddie

Diamond Member
Jul 18, 2010
4,783
4,759
136
Because most likely that IPC is measured at 1T, so It doesn't matter If you use Zen4(5) or Zen4(5)c core, you still have the whole L3 for that single core.

BTW, I question If L3 is really partitioned between standard and dense cores.

This is PHX2 and L3 doesn't look like It's separated physically.
I think It's not separated at all. Just the amount is 16MB for 12 cores.
Ok, that makes a lot of sense.
 
Reactions: TESKATLIPOKA

BorisTheBlade82

Senior member
May 1, 2020
667
1,022
136
IMHO, as long as the power budget permits having higher all core frequencies than Zen5c is able to achieve, then the MT performance of 4 Zen5 + 8 Zen5c should be identical to 12 Zen5. I would assume that to be the case for 35w Package Power and below.
I still hope that they might improve the efficiency of the small cores in the next gen which could lead to the B.l package being faster at low wattages than a full blown 12 Zen5 setup.

Regarding a unified L3 I would say that there is a strong incentive for AMD: They could save big on area if they could make both clusters use the same 16/24 Mbyte of L3 instead of having to duplicate them - especially as cache gets more and more costly with each node.
 

StefanR5R

Elite Member
Dec 10, 2016
5,633
8,107
136
In the Zen4 gen, the Zen4c have 1/2 L3 as full Zen4, and AMD still claims same IPC, with no disclaimers as you have inserted, which makes no sense. How you claim (if the cache amount is the same), when it can't be, as they're 2 unique designs, with many, but not all elements shared.
While the physical design differs, the microarchitecture — i.e. frontend, execution units and so forth, and the resulting execution width and instruction latencies — is exactly the same (from what is publicly known, and there are no indications to the contrary). And I should rather have said: "But they have the same IPC in the Zen 4 generation (if the amount of cache is the same, if memory interface is the same, or if the present amount of cache is sufficient for the considered workload)."
 

StefanR5R

Elite Member
Dec 10, 2016
5,633
8,107
136
The Zen 4 "dense" cores are only area optimized (and the major repercussion of it is their lower f_max), not power optimized.
Yes they are, literally lower Cac.
OK, what I claimed was incorrect; not sure why I said so. (Zen 4c's different physical design does shave off power consumption additionally, beyond what just a lower average operating frequency brings.) Kai Troester, Zen 4 lead architect, phrased it this way: "What we did to create Zen 4c is take the Zen 4 functional design, which keeps IPC and features identical [...] We targeted a design frequency optimized for cloud servers, and with that lower frequency we were able to significantly reduce the area of the core. And that smaller area led to tremendous increase in power efficiencies, allowing us to provide 33 % more cores in the same power envelope at high frequency and IPC." (Source: Hot Chips 2023 CPU 1 session video) — Is there something more detailed published anywhere?
 

moinmoin

Diamond Member
Jun 1, 2017
4,988
7,758
136
So will Strix Boind have an LLC ?

Edit: Typo. Correction: "Strix Point"
LLC stands for Last Level Cache. So if a chip has no L4$, L3$ is the LLC. If no L3$, L2$ is it, and so on. So since Strix Point definitely has cache the answer is yes? I'm not sure what you actually wanted to know. Maybe if LLC is shared between CPU and iGPU? (That be a no.)
 

qmech

Member
Jan 29, 2022
82
179
66
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |