Discussion Zen 5 Speculation (EPYC Turin and Strix Point/Granite Ridge - Ryzen 9000)

Page 149 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

yuri69

Senior member
Jul 16, 2013
396
641
136
"2 basic block fetch"? 10-15% IPC goal?

The high-level indeed looks like Fire Storm. However, the main question are the OoO sizes - the good 'ol Fire Storm features a 600+ "ROB" structure. Is Zen similarly fat?
 

Saylick

Diamond Member
Sep 10, 2012
3,217
6,586
136
Hmm, Have I misunderstood it. I always thought i'ts two 256-bit loads and one 256 bit store:

I got it from this slide:

Moving on, the load/store units within each CPU core have also been given a buffer enlargement. The load queue is 22% deeper, now storing 88 loads. And according to AMD, they’ve made some unspecified changes to reduce port conflicts with their L1 data cache. Otherwise the load/store throughput remains unchanged at 3 loads and 2 stores per cycle.
 

DisEnchantment

Golden Member
Mar 3, 2017
1,623
5,894
136
  • The same 12-way 48KB L1 cache as Colden Cove (hopefully without the latency penalty)
  • 8-wide dispatch (+2 vs Alder Lake and Zen 4)
  • 6 ALUs (+1 vs Alder lake +2 vs Zen 4)
  • 4 loads / 2 stores per cycle (vs 3/2 for Golden cove, 2 /1 for Zen 4)
    • - if I'm reading this right, these are 512bit (64 byte) ? That's a massive uplift from Zen 4 if true (4x the throughput in ideal AVX-512 scenarios)
The biggest unknown for me is how do they plan to feed the beast? There are no mentions of any decoder changes, surely it would be an absurd bottleneck, if not changed?
It has unified schedulers too, big difference from Z4. Curious why they are not submitting compiler patches yet for such a big change.

Some hard facts compared to usual twitter rhetoric.
Going by AMD's usually sandbagging of CPU perf gains, I would say 20% IPC more or less. Not 30%+ for sure. If it takes a clock speed hit then the gains are going to be incremental not groundbreaking as what it is made it to be. Although I would have too say the gains are low for such a massive change in architecture compared to the Z2 --> Z3 evolution.
 

DisEnchantment

Golden Member
Mar 3, 2017
1,623
5,894
136
What is interesting though is the native 16C CCX. Clark had this in mind during the 2021 interview with AT looks like.
Also Z6 has a 32C CCX
The new low power core option would be interesting if it keeps the same internals and just cut stuff here and there and fit that in a a tight power envelope.
 

adroc_thurston

Platinum Member
Jul 2, 2023
2,508
3,661
96

Joe NYC

Platinum Member
Jun 26, 2021
2,073
2,587
106

What is your take on Zen 6 architecture, of CCDs stacked on IO die and CCDs connected with silicon bridges.

Seems like if they are already stacked on top of a single IOD, there would not be much of a reason to have silicon bridge between them.

Unless, there could be multiple IO dies for additional of memory channels.
 

adroc_thurston

Platinum Member
Jul 2, 2023
2,508
3,661
96
What is your take on Zen 6 architecture
Good fun if morbidly expensive in server.
CCDs stacked on IO die
No.
Just no.
They're stack on AID in Venice.
IOD is a different thingy.p
there would not be much of a reason to have silicon bridge between them
You chain AIDs just like you do it in Navi4c.
Unless, there could be multiple IO dies for additional of memory channels.
Yes Venice looks like Granite Rapids on crack cocaine.
 

Joe NYC

Platinum Member
Jun 26, 2021
2,073
2,587
106
Good fun if morbidly expensive in server.

No.
Just no.
They're stack on AID in Venice.
IOD is a different thingy.p

You chain AIDs just like you do it in Navi4c.

Yes Venice looks like Granite Rapids on crack cocaine.

First, I forgot to say that what I posted about Zen 6 was in the latest MLID video.

But CCDs stacked on top of AIDs in Venice sounds very promising. It seems that AMD is really going all in on the MALL. in all future products.
 
Reactions: Kaluan

branch_suggestion

Senior member
Aug 4, 2023
201
435
96
Good fun if morbidly expensive in server.

No.
Just no.
They're stack on AID in Venice.
IOD is a different thingy.p

You chain AIDs just like you do it in Navi4c.

Yes Venice looks like Granite Rapids on crack cocaine.
I guess I'll come out and say what standard Venice probably looks like now.
6 AID's, each with 4 2.5D stacked 8 core CCD's, each AID connected to each other with silicon bridges.
And finally 6 IO dies, connected to the outside of each AID with fanouts, each IO die with 2 memory channels and a bunch of PCIE/CXL lanes.
192 cores, huge fully unified MALL, along with so much room for all sorts of uncore accel and other server stuff.
Venice gets rid of EPYC's last weaknesses, for a huge fee, of course. Many will probably stick with Turin initially but Venice will become a huge stick of doom over time.
 

inf64

Diamond Member
Mar 11, 2011
3,706
4,050
136
So MLID seems to think that Zen 5 will bring Zen 4-like IPC , even though the changes are way more massive than Zen 2 -> Zen 3. He also talks about "fake leakers" who claim massive IPC increases for Zen 5, but he himself leaked that RWC will have up to 26% IPC increase (turns out it is 0-1%) and Arrow Lake will be up to 40% (while we have projections from intel that put it at 5-10% total ST jump vs Raptor Lake). The guy is something special.
 

Joe NYC

Platinum Member
Jun 26, 2021
2,073
2,587
106
They have to do that to keep that perf gravy train rolling, especially in server.
HBM is no solution so MALL it is.

Well, Mi300 seems to have both, but is in a different class as far as cost.

I wonder how much performance increase there could be for a relatively modest sized caches in the MALL.

Or is it a scenario that there is a small chunk of memory that is being contested for by number of CCDs, and that small chunk of memory (being served from MALL) will then relieve a big bottleneck?
 

DisEnchantment

Golden Member
Mar 3, 2017
1,623
5,894
136
I guess I'll come out and say what standard Venice probably looks like now.
6 AID's, each with 4 2.5D stacked 8 core CCD's, each AID connected to each other with silicon bridges.
And finally 6 IO dies, connected to the outside of each AID with fanouts, each IO die with 2 memory channels and a bunch of PCIE/CXL lanes.
192 cores, huge fully unified MALL, along with so much room for all sorts of uncore accel and other server stuff.
Venice gets rid of EPYC's last weaknesses, for a huge fee, of course. Many will probably stick with Turin initially but Venice will become a huge stick of doom over time.
So what do you think of the Zen 5 leaks?

By the way, it looks like an external presentation. Not an internal presentation like what MLID is saying. Why would someone need an NDA for an internal document.
They just need access rights to the correct repos/sharepoint/team site/Confluence/etc to access the documents.

Probably AMD made this presentation to some CSP/OEM/SI etc.
 

adroc_thurston

Platinum Member
Jul 2, 2023
2,508
3,661
96
I wonder how much performance increase there could be for a relatively modest sized caches in the MALL.

Or is it a scenario that there is a small chunk of memory that is being contested for by number of CCDs, and that small chunk of memory (being served from MALL) will then relieve a big bottleneck?
Yeah it's basically a bandwidth ramp to keep 192/256 chungus cores functional.
Probably AMD made this presentation to some CSP/OEM/SI etc.
Yeah, some time ago.
This had no socket perf numbers for Turin tho.
 
Reactions: Tlh97 and Joe NYC
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |