Discussion Zen 5 Speculation (EPYC Turin and Strix Point/Granite Ridge - Ryzen 9000)

Page 28 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

Kepler_L2

Senior member
Sep 6, 2020
383
1,481
106
Not very likely I think, diminishing returns for DT especially with faster DDR5.

I also would like to see a 12 core CCD on N4P. But...the CCD is going to be large.
View attachment 77369
Totally guessing based on Mike's statement's

12C per L3

L2 1MiB --> 1.5MiB
L3 4MiB/Slice --> 3MiB/Slice = 36MiB/CCD

GMI SerDes PHY replaced with a lower power PHY with small beach head.
A ring with quadrants would probably be suitable to cut latency across 12 cores. Additionally the faster interconnect should help when snooping the other CCX.

Even with this, CCD seems a bit too big for AMD, ~100mm2 is quite big. Unless, they manage to use the perf gains from N4P and claw back some density tradeoffs to reach at least ~90mm2.
On the other hand N4/5 supply is plenty in 2024, 180k+ wpm.
Additionally, the IOD will be totally new. RDNA3+ and AIE for DT as well.

Too much unknowns but interesting for me to ponder about this, as usual.
RDNA3+ in GNR? Are you sure about that?
 

Joe NYC

Platinum Member
Jun 26, 2021
2,073
2,587
106
As far as the Mi300 layout, another way to think of it is that the picture (diagram) is intentionally incorrect about the 1 mystery chiplet between the other 2 chiplets/

The simplest solution to 9 compute chips is 6 GPGPU chips + 3 CPU CCDs, each with 8 cores of Zen 4. Equals 24 cores. Majority of internals of Zen4 CCD could be preserved.

It seems that Tom from MLID thinks that this is the case - and he has been most on the money.
 
Last edited:
Reactions: Tlh97 and Vattila

DisEnchantment

Golden Member
Mar 3, 2017
1,623
5,894
136
Going by Raphael having RDNA2 in its IOD it doesn't seem like a huge stretch.
It would make sense to do so and I was only suggesting it mostly from ease of software maintainability not for performance per se, because in 2024 RDNA2 would be 4 years old.
From the amount of transistor budget, RDNA3 CUs would be not worth it but the Radiance engine and Multimedia in RDNA3 would be much better than RDNA2.

On top of that, I believe the necessity of an AI engine on chip to run Win 12 (supposedly) features would mean AMD might think about using a new node for IOD to integrate the XDNA.
Or they could stack N5 XDNA on N6 IOD, in which case they need a new IOD. Or, they could implement XDNA on N6. AMD has been keen to support MS requirement on their chips like Pluton for instance.
 

Tigerick

Senior member
Apr 1, 2022
679
559
106
ExecuFix mentioned max cTDP of Turin at 50% more than Genoa.
I suppose we could guess a similar increase in core count assuming they didn't regress in per core TDP on a newer node. Between 128 and 144 Cores . (33% - 50% more cores)
400W max cTDP/4.2W per core for Genoa(9564), 280W max cTDP/4.4W per core for Milan(7763)/Rome(7H12), ignoring IOD for this math

I have been thinking about max TDP for a while, but I would like to calculate how much power per chiplet to have clearer picture. Both Genoa and Bergamo chiplets have max default TDP of 360W, let's assume Turin and Bergamo 5c with max cores as rumored (that would be 192 Zen 5 cores and 256 Zen 5c cores) have max TDP of 600W per socket. Below is my calculation:

RaphaelGenoaBergamoTurinBergamo 5c
CPU ArchitectureZen 4Zen 4Zen 4cZen 5Zen 5c
TSMC - Power EfficiencyN5N5N5N3E - 34%N3E - 34%
Cores Per Chiplet8816816
Max CCD21281612
L3 Cache Per Chiplet32MB32MB32MB32MB32MB
Max L3 Caches64MB384MB256MB512MB384MB
TDP65W360W360W600W600W
- IOD power (120W)240W240W480W480W
Power per chiplet30W20W30W30W40W
Power per Core3.75W2.5W1.88W3.75W3.33W

  • Both Bergamo and Bergamo 5c's L3 caches would be smaller than Genoa/Turin, you can't have both more core counts and bigger L3 cache at the same time.
  • I am expecting some clock regression and higher IPC from Zen 5 architecture. What do you think?
 
Last edited:

MadRat

Lifer
Oct 14, 1999
11,910
239
106
If workloads get switched dynamically, can you really go by a uniform chiplet number? As chiplets heat up the load increases. But when workloads get shifted around arrays of chiplets like the timing on a V8 motor, its going to be a sliding rule range depending on how warm the entire array is overall. Even the L2 space seems to be moving towards dynamic addressing, where as long as the thread stays in the same mini-cluster, it has access to the same L2 although the thread may jump processors.

This is why its not really relevant to load one core and to multiply the number to compare with dynamic loads that real users will see. There are too many variables you cannot directly measure.
 

Geddagod

Golden Member
Dec 28, 2021
1,159
1,035
106
I have been thinking about max TDP for a while, but I would like to calculate how much power per chiplet to have clearer picture. Both Genoa and Bergamo chiplets have max default TDP of 360W, let's assume Turin and Bergamo 5c with max cores as rumored (that would be 192 Zen 5 cores and 256 Zen 5c cores) have max TDP of 600W per socket. Below is my calculation:

RaphaelGenoaBergamoTurinBergamo 5c
CPU ArchitectureZen 4Zen 4Zen 4cZen 5Zen 5c
TSMC - Power EfficiencyN5N5N5N4P - 22%N3E - 34%
Cores Per Chiplet88161632
Max CCD2128128
L3 Cache Per Chiplet32MB32MB32MB64MB64MB
Max L3 Caches64MB384MB256MB768MB512MB
TDP65W360W360W600W600W
Power per chiplet30W30W45W50W75W
Power per Core3.75W3.75W2.81W3.13W2.34W

  • Both Bergamo and Bergamo 5c could only have 8 chiplets per socket cause they are having bigger die size due to double core counts. OTOH, their L3 caches would be smaller than Genoa/Turin, you can't have both more core counts and bigger L3 cache at the same time.
  • With power efficiency improvements, I believe Turin would fit in double core counts with same socket. I also expecting some clock regression and higher IPC from Zen 5 architecture. What do you think?
These calculations are a bit scuffed cuz the IO die isn't a part of the calculations, as far as I see here
 

Tigerick

Senior member
Apr 1, 2022
679
559
106
These calculations are a bit scuffed cuz the IO die isn't a part of the calculations, as far as I see here
For simple round figures, I omit the IOD's power. And with 8-12 chiplets per socket, the IOD's power figure is negligible. You could deduct around 2 watts per chiplet if you want. What I am interested is does the power increase warrants doubling core counts in upcoming Zen 5/5c chiplet?
 

DisEnchantment

Golden Member
Mar 3, 2017
1,623
5,894
136
The rumors say? 🙂 This is all out in the open with extreme detail for months..



View attachment 52061


View attachment 52062
I have been thinking about max TDP for a while, but I would like to calculate how much power per chiplet to have clearer picture. Both Genoa and Bergamo chiplets have max default TDP of 360W, let's assume Turin and Bergamo 5c with max cores as rumored (that would be 192 Zen 5 cores and 256 Zen 5c cores) have max TDP of 600W per socket. Below is my calculation:

RaphaelGenoaBergamoTurinBergamo 5c
CPU ArchitectureZen 4Zen 4Zen 4cZen 5Zen 5c
TSMC - Power EfficiencyN5N5N5N4P - 22%N3E - 34%
Cores Per Chiplet88161632
Max CCD2128128
L3 Cache Per Chiplet32MB32MB32MB64MB64MB
Max L3 Caches64MB384MB256MB768MB512MB
TDP65W360W360W600W600W
Power per chiplet30W30W45W50W75W
Power per Core3.75W3.75W2.81W3.13W2.34W

  • Both Bergamo and Bergamo 5c could only have 8 chiplets per socket cause they are having bigger die size due to double core counts. OTOH, their L3 caches would be smaller than Genoa/Turin, you can't have both more core counts and bigger L3 cache at the same time.
  • With power efficiency improvements, I believe Turin would fit in double core counts with same socket. I also expecting some clock regression and higher IPC from Zen 5 architecture. What do you think?
From Genoa thermal design guide, Max IOD power is 120W around. Which gives about 2.91W/Core Max.

From Execufix figure, Turin max TDP is 600W, assuming IOD remains unchanged at 120W that's 5W/Core.

Unless AMD goes full retard we can assume they will not use 1.7x more power staying at the same core count.

So excluding IOD power of 120W,
Turin 96C@480W=5W/Core or 1.71x that of Genoa.
Turin 128C@480W=3.75W/Core or 1.3x that of Genoa
Turin 144C@480W=3.33W/Core or 1.14x that of Genoa
Turin 160C@480W=3W/Core or 1x that of Genoa.

Turin should boost core count significantly. Either that or Turin is a fail of epic proportions considering N4P (assuming Turin is not using N3E) have a significant efficiency gain over N5.
 

BorisTheBlade82

Senior member
May 1, 2020
667
1,022
136
For simple round figures, I omit the IOD's power. And with 8-12 chiplets per socket, the IOD's power figure is negligible. You could deduct around 2 watts per chiplet if you want. What I am interested is does the power increase warrants doubling core counts in upcoming Zen 5/5c chiplet?
I'd say yes. Because of the nature of the V/f curve, half the power per core nets you quite a lot more than half the performance. With the right architecture, you basically get the more, the lower you go.
 

Tigerick

Senior member
Apr 1, 2022
679
559
106
Thanks for the IOD's TDP numbers, I am surprised by the 120W figures, guess 12-channel DDR5 support needs more power. I have updated the table with TDP after deducting IOD's power. Guess what, the new numbers are essentially double. That's mean even with double core counts, each core still getting same amount of power..... And I am more confident with AMD's planning of Turin family and to some extent, Granite Ridge...

AMD could potentially double the cores for upcoming desktop CPU, with total TDP of around 90W (40W + 40W + 10W for IOD). As for successor of Dragon Range, AMD could just use one chiplet to serve 16-core Zen 5 with TDP of 50W.
 

Geddagod

Golden Member
Dec 28, 2021
1,159
1,035
106
I'd say yes. Because of the nature of the V/f curve, half the power per core nets you quite a lot more than half the performance. With the right architecture, you basically get the more, the lower you go.
I believe AMD's recent architectures scale even better than Intel's in that regard as well. They don't continue scaling as well into higher power levels as Intel, but they have way better scaling at the low power levels.
 
Reactions: Tlh97 and Joe NYC

Geddagod

Golden Member
Dec 28, 2021
1,159
1,035
106
Thanks for the IOD's TDP numbers, I am surprised by the 120W figures, guess 12-channel DDR5 support needs more power. I have updated the table with TDP after deducting IOD's power. Guess what, the new numbers are essentially double. That's mean even with double core counts, each core still getting same amount of power..... And I am more confident with AMD's planning of Turin family and to some extent, Granite Ridge...

AMD could potentially double the cores for upcoming desktop CPU, with total TDP of around 90W (40W + 40W + 10W for IOD). As for successor of Dragon Range, AMD could just use one chiplet to serve 16-core Zen 5 with TDP of 50W.
I just find a 16 core Zen 5 chiplet extremely hard to swallow.
The core size increase from Zen 2 to Zen 3 was ~35%, and the core logic itself (without L2 and L3) was a ~40% increase.
Zen 3 was also a "grounds up architecture" much like Zen 5 is looking to be.
Now let's imagine Zen 5.
Let's assume Zen 5 scales similar to Zen 3.
Zen 4's 3.69 mm^2 x 1.3 = 4.80, x 0.95 because of 4nm, 4.56 mm^2 per core.
4.56 x 16 = 72.96 mm^2 for the cores alone. This isn't counting stuff like power banks on the core, or even the L3 for example. Essentially, just the cores and L2 of 16 zen 5 cores are the same size as the entire zen4 CCD. The L3 on zen 4 is an additional 25mm^2 too. All things considered, a 16 core zen 5 CCD would be ~115mm^2 imo. That's a >60% increase in the size of each CCD. I don't think AMD do that.
On 3nm, it looks a lot better, but I don't think 3nm on zen 5 is going to be standard anyway except maybe for Zen 5C or servers.
 
Reactions: Tlh97 and Exist50

Kocicak

Senior member
Jan 17, 2019
982
973
136
I just find a 16 core Zen 5 chiplet extremely hard to swallow. ... That's a >60% increase in the size of each CCD. I don't think AMD do that.
We have had 8 core chiplets for a while now, the core count increase is in order. 12 is also a nice number.

BTW not all cores on the chiplet need to be the same size! You could easily have 8 large and a bunch of small ones on one chiplet.

You could have one chiplet with 12 large cores and a second sort of chiplet with say 24-36 small cores. You could build very interesting CPUs for desktop PCs with combining just these two chiplet sorts in two chiplet CPU.
 
Last edited:

BorisTheBlade82

Senior member
May 1, 2020
667
1,022
136
I just find a 16 core Zen 5 chiplet extremely hard to swallow.
The core size increase from Zen 2 to Zen 3 was ~35%, and the core logic itself (without L2 and L3) was a ~40% increase.
Zen 3 was also a "grounds up architecture" much like Zen 5 is looking to be.
Now let's imagine Zen 5.
Let's assume Zen 5 scales similar to Zen 3.
Zen 4's 3.69 mm^2 x 1.3 = 4.80, x 0.95 because of 4nm, 4.56 mm^2 per core.
4.56 x 16 = 72.96 mm^2 for the cores alone. This isn't counting stuff like power banks on the core, or even the L3 for example. Essentially, just the cores and L2 of 16 zen 5 cores are the same size as the entire zen4 CCD. The L3 on zen 4 is an additional 25mm^2 too. All things considered, a 16 core zen 5 CCD would be ~115mm^2 imo. That's a >60% increase in the size of each CCD. I don't think AMD do that.
On 3nm, it looks a lot better, but I don't think 3nm on zen 5 is going to be standard anyway except maybe for Zen 5C or servers.
I am thinking more in the line of 8c Zen5 CCD + VCache and 16c CCD, 2*8 CCX Zen5c or Zen4c.
 
Reactions: Tlh97 and Geddagod

Geddagod

Golden Member
Dec 28, 2021
1,159
1,035
106
In my humble opinion, it's gonna be 8C or 12C 4nm Zen 5 CCD, 8C or 12C 4nm Zen 5 CCD + Vcache, 16C 3nm Zen 5c.
But Zen 5 is so far out still, this is just complete speculation
Plus AMD has gotten so good at preventing leaks (cough zen 4 cough)
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |