Discussion Zen 5 Speculation (EPYC Turin and Strix Point/Granite Ridge - Ryzen 9000)

Page 49 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

Kaluan

Senior member
Jan 4, 2022
500
1,071
96
I wonder if Strix Point will have only 1 monolithic die (up to 4C+8c/16CU), or split into 2 like PHX. Same for Grey Hawk refresh.

Those, along with PHX2 and Mendocino may fit into fanless designs.

RDNA3+ is allegedly implementing the scalar arithmetic logic units planned for RDNA4 (Kepler said), along with reworks of whatever got busted in the current design (my guess), so likely will clock much higher/will be more efficient and have less workarounds/bottlenecks in feeding data through the graphics pipeline.
 
Last edited:
Reactions: Tlh97 and Joe NYC

Anhiel

Member
May 12, 2022
64
18
41
At that size it makes no sense to use chiplets. They'll need another chiplet design for lower end products. I'm guessing that work will go along with the embedded lineup.

Anyhow, the more interesting question is whether the IO die for Strix Halo is made of two chiplets each with 20CU and 128-bit memory controller/connections.

Obviously, the number of different chiplets has increased for the Zen5 generation. That's gonna cost a lot and with a larger and complex organization/package production overhead.
I just hope pricing won't be too bad for Strix Halo SKUs or it won't sell much better than Apple's M-MAX, Ultras etc. Apple still has a bandwidth advantage here.

And since it seems MTL might have 2 Crestmont cores in their SOC, Intel has stepped up their idle power usage advantage. In these new leaks Zen5c/d cores still don't seem to offer any real power saving advantage beyond lowering CLK. Well, the 3% savings I get could as well be a rounding error here. So it seems they only offer an area advantage. Good enough for mobile and high core density. But without going to smaller process node I've doubts about their competitiveness against Intel's e-cores and ARM.
 

BorisTheBlade82

Senior member
May 1, 2020
667
1,022
136
I wonder if Strix Point will have only 1 monolithic die (up to 4C+8c/16CU), or split into 2 like PHX. Same for Grey Hawk refresh.
How come, you think that PHX is not monolithic? Might you have confused that with Dragon Range (Raphael, but for mobile)? And did you mean Hawk Point, which basically is to Phoenix Point what Lucienne was to Renoir?
 

Tigerick

Senior member
Apr 1, 2022
683
565
106
Guys, I have compiled the upcoming mobile APU from AMD next year. Man, AMD is really targeting full range of notebook segments and price points. There are six new models, I am trying to list main specs and launching dates. Again, this is based on leaks from RGT and MLID, and some speculations and corrections from mine. So any insights please let me know, I will update the table accordingly.

NameModelLaunch DateNodeCPU coresL3 CacheMemory LPDDR5xMemory BWGPUALUICAIE (TOPS)TDP
EscherR3 80502025N4P2xZen5 + 4xZen5c12 MB64-bit 853368 GB/sRDNA3+ 4 CU 256SP512NA?15W
STXR7 8050Q3 2024N4P4xZen5 + 8xZen5c24 MB128-bit 8533136 GB/sRDNA3+ 8CU 512SP1024NA2015-45W
STX Halo/ SarlakR9 8050Q4 2024N4P + N4P8xZen5 + 8xZen5c40 MB256-bit 8533272 GB/sRDNA3+ 20CU 1280SP256032MB4055W+
6xZen5 + 8xZen5c32 MB192-bit 8533204 GB/sRDNA3+ 16CU
1024SP
2048??
Fire RangeR7&9 8055Q3 2024N4P x 316xZen564 MB128-bitRDNA3+ 2CU 128SP256NA2055W+
 
Last edited:
Reactions: Joe NYC

soresu

Platinum Member
Dec 19, 2014
2,813
2,018
136
Guys, I have compiled the upcoming mobile APU from AMD next year. Man, AMD is really targeting full range of notebook segments and price points. There are six new models, I am trying to list main specs and launching dates. Again, this is based on leaks from RGT and MLID, and some speculations and corrections from mine. So any insights please let me know, I will update the table accordingly.

NameModelLaunch DateNodeCPU coresL3 CacheMemory LPDDR5xMemory BWGPUALUICTDP
PHX2R5
7040
Q3 2023N4 115mm22xZen4 + 2xZen4c4+4=8MB64-bit 750060 GB/sRDNA3 4CU 256SP256NA15W
STX3R5 8050Q3 2024N4P2xZen5 + 4xZen5c12 MB64-bit 853368 GB/sRDNA3+ 4CU 256SP512NA15W
PHX+
Hawk Point
R7&9 8040Q1 2024N4P8xZen416 MB128-bit 8533136 GB/sRDNA3+ 12CU 768SP1536NA15-45W
STX2R7 8050Q3 2024N4P4xZen5 + 8xZen5c24 MB128-bit 8533136 GB/sRDNA3+ 8CU 512SP1024NA15-45W
STX1 HaloR9 8050Q3 2024N4P + N4P8xZen5 + 8xZen5c32 MB256-bit 8533272 GB/sRDNA3+ 20CU 1280SP256032MB25-120W
6xZen5 + 8xZen5c28 MB128-bit 8533136 GB/sRDNA3+ 10CU 640SP128032MB
Fire RangeR7&9 8055Q3 2024N4P x 316xZen564 MB128-bitRDNA3+ 2CU 128SP256NA55W+
I have my doubts about a 20 CU APU needing 120W TDP when there is a 12 CU APU in that table that only draws a max of 45W.

Unless Zen5 needs a dramatic increase in power of course 🤔
 
Last edited:

Abwx

Lifer
Apr 2, 2011
11,103
3,780
136
I have my doubts

I have my doubts about a 20 CU APU needing 120W TDP when there is a 12 CU APU in that table that only draws a max of 45W.

Unless Zen5 needs a dramatic increase in power of course 🤔
If CPU + GPU use 45W with 12CUs then a 20CUs APU will use at most 70W, and that s at same node and uarch.
 
Reactions: Tlh97 and soresu

moinmoin

Diamond Member
Jun 1, 2017
4,989
7,758
136
Needs...

Through all the fanless configuration talk here before I kept scratching my head since with my 6c6t 4500U the experience is that it runs at 3-4W whole system most of the time, with a TDP limit of 9W making only a difference in longer running heavy compute tasks (especially iGPU). So from that pov I don't see AMD not having a product for low TDP use, but no manufacturer using existing chips and limiting them to such low TDPs.

Instead the default TDP set seems to be on a rise in laptops as well now.
 

Anhiel

Member
May 12, 2022
64
18
41
I have my doubts about a 20 CU APU needing 120W TDP when there is a 12 CU APU in that table that only draws a max of 45W.

Unless Zen5 needs a dramatic increase in power of course 🤔

If CPU + GPU use 45W with 12CUs then a 20CUs APU will use at most 70W, and that s at same node and uarch.

The iGPU should consume around 37.5W. The uncertainty here is the RDNA4 modification.
Supposely, the cores are clocked similar to Ryzen 9 7940HS: with a boost of 5.2GHz and all-core ~4.9GHz then the 16 cores should consume around 2x40W. So 120W looks legit. (compared to 7950X we can see here how horribly bad TDP gets with higher clocks from here on out)
 
Reactions: lightmanek

Saylick

Diamond Member
Sep 10, 2012
3,271
6,755
136
lol at the literal sketch of a ladder on top of that AMD slide.

Ultimately, it like AMD is moving from a ring bus to a mesh of some sort? That way, data from one core doesn't have to traverse all the way around the ends of the loop to get to a core that is directly across from it.
 

Joe NYC

Platinum Member
Jun 26, 2021
2,153
2,735
106
And here we go again:


Is there any news from his own sources of just what is already know him plus his projections?

2 takeaways for me:
- the new bus, that can scale to 16 unified cores in a single CCD.
- larger L2 without latency penalty being possible in future cores

Two of those combined lead me to believe that AMD plans on dropping L3 entirely from future generations of processors. Unknown if it will be Zen 5 or Zen 6.
 

Abwx

Lifer
Apr 2, 2011
11,103
3,780
136
The iGPU should consume around 37.5W. The uncertainty here is the RDNA4 modification.
Supposely, the cores are clocked similar to Ryzen 9 7940HS: with a boost of 5.2GHz and all-core ~4.9GHz then the 16 cores should consume around 2x40W. So 120W looks legit. (compared to 7950X we can see here how horribly bad TDP gets with higher clocks from here on out)

At 45W the CPU and uncore use 10W, 35W is left for the GPU, if you increase the CU count by 67% from 12 to 20 CUs then GPU power will increase accordingly to 35 x 1.67 = 58W.
 

Kepler_L2

Senior member
Sep 6, 2020
425
1,734
106
Guys, I have compiled the upcoming mobile APU from AMD next year. Man, AMD is really targeting full range of notebook segments and price points. There are six new models, I am trying to list main specs and launching dates. Again, this is based on leaks from RGT and MLID, and some speculations and corrections from mine. So any insights please let me know, I will update the table accordingly.

NameModelLaunch DateNodeCPU coresL3 CacheMemory LPDDR5xMemory BWGPUALUICTDP
PHX2R5
7040
Q3 2023N4 115mm22xZen4 + 2xZen4c4+4=8MB64-bit 750060 GB/sRDNA3 4CU 256SP256NA15W
STX3R5 8050Q3 2024N4P2xZen5 + 4xZen5c12 MB64-bit 853368 GB/sRDNA3+ 4CU 256SP512NA15W
PHX+
Hawk Point
R7&9 8040Q1 2024N4P8xZen416 MB128-bit 8533136 GB/sRDNA3+ 12CU 768SP1536NA15-45W
STX2R7 8050Q3 2024N4P4xZen5 + 8xZen5c24 MB128-bit 8533136 GB/sRDNA3+ 8CU 512SP1024NA15-45W
STX1 HaloR9 8050Q3 2024N4P + N4P8xZen5 + 8xZen5c32 MB256-bit 8533272 GB/sRDNA3+ 20CU 1280SP256032MB25-120W
6xZen5 + 8xZen5c28 MB128-bit 8533136 GB/sRDNA3+ 10CU 640SP128032MB
Fire RangeR7&9 8055Q3 2024N4P x 316xZen564 MB128-bitRDNA3+ 2CU 128SP256NA55W+
AFAIK Strix Point (4+8, 16 CU, 4nm monolithic) is STX1, it was originally 8+4 and 3nm but got redefined due to TSMC issues. Strix Halo is (at least internally) called SAR(Sarlak) and STX3 has been cancelled entirely.
 
Reactions: Tigerick

itsmydamnation

Platinum Member
Feb 6, 2011
2,821
3,313
136
2 takeaways for me:
- the new bus, that can scale to 16 unified cores in a single CCD.
- larger L2 without latency penalty being possible in future cores

Two of those combined lead me to believe that AMD plans on dropping L3 entirely from future generations of processors. Unknown if it will be Zen 5 or Zen 6.
how do you do cache coherency .

Why would you drop L3 when you could do what IBM has done.

You people make 0 sense all the damn time!
 

Joe NYC

Platinum Member
Jun 26, 2021
2,153
2,735
106
how do you do cache coherency .

Why would you drop L3 when you could do what IBM has done.

You people make 0 sense all the damn time!

I meant drop L3 from main CCD die. (sorry, about not spelling it out).

More cores (up to 16) crowding the CCD and L2 size decreasing the need for L3 would seem like a good way to push L3 out of the CCD and maybe only into the V-Cache or some other level of cache, such as system level cache.

BTW, those large L2s, if they acted together the same as what IBM outlined would work even better then having L3 present on the die.
 
Reactions: Tlh97

Anhiel

Member
May 12, 2022
64
18
41
At 45W the CPU and uncore use 10W, 35W is left for the GPU, if you increase the CU count by 67% from 12 to 20 CUs then GPU power will increase accordingly to 35 x 1.67 = 58W.
Dunno which SKU you refer to but I'm pretty sure your information is wrong for many reasons.
One we are looking at a N5 to N4 transition, two Radeon 780M N5 12CU has 15W TPD. These are the facts. Look up techpowerup.
Assuming the minor changes from RDNA4 improvement and assuming the basic 16CU eats up the little N4 30% power savings.. this gives 2.5x for 40CU, hence, my 37.5W.
What's been left out here is the 7? core AI engine we don't know anything about. I suppose it can be dropped from ordinary usage scenario.
 

Anhiel

Member
May 12, 2022
64
18
41
That cache "leak" above has so many errors especially when it contradicts the server source referred to.
I meant drop L3 from main CCD die. (sorry, about not spelling it out).

More cores (up to 16) crowding the CCD and L2 size decreasing the need for L3 would seem like a good way to push L3 out of the CCD and maybe only into the V-Cache or some other level of cache, such as system level cache.

BTW, those large L2s, if they acted together the same as what IBM outlined would work even better then having L3 present on the die.
Doesn't make a lot of sense at this "early" point in time, not to mention the voltage sensitivity of having L3 as a V-cache die on top.
The recent burn out CPUs are prove of the current unsafe design/control.

I don't think L2 has grown significant as suggested for the simple reason that the die size has to be kept at the same size as before as much as possible despite node shrinks that mostly used to add other stuff (5-wide expansion) or squeeze the c/d variants. Not to mention SRAM doesn't shrink well.

The interesting rumor bit is that L2 is somehow unified. I can only guess it's somewhat similar to IBM's Z15 virtual cache or moving toward that kind of solution. I'm guessing it's more like keeping core-bound data at closer L2 cells while larger data at more distant & area sharing cells. It's still conjecture at this point. Hence, depending on how you view it the L2 has grown even if it's not.
 

Tigerick

Senior member
Apr 1, 2022
683
565
106
I have my doubts about a 20 CU APU needing 120W TDP when there is a 12 CU APU in that table that only draws a max of 45W.

Unless Zen5 needs a dramatic increase in power of course 🤔
Yeah, 120W does seem high, even 16-core Zen4 7945HX only requires 75W. Besides high clocks, reasons that I can think of are additional 128-bit memory bus (that would require 32-bit x 4) and huge amount of caches (FYI, M2 Pro has total L2+L3 cache of 60MB, STX Halo has 64MB total cache excluding L2 cache of each CPU cores, I would assume at least 16MB L2 caches, so total cache of Halo would be at least 80MB). That's why I don't believe of 96MB rumors unless Zen5/5c has 2MB of L2 cache each...
 
Last edited:
Reactions: Tlh97
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |