Discussion Zen 5 Speculation (EPYC Turin and Strix Point/Granite Ridge - Ryzen 9000)

Page 447 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

Mopetar

Diamond Member
Jan 31, 2011
7,961
6,312
136
Clockspeed, yields, node? 8c could be on an older node than 16c and 32c so would be quite a bit cheaper. V-cache compatibility.

Those are just off of the top of my head.

The biggest is cost. 8C is a smaller chiplet and there are plenty of consumers who don't even need that many cores.

I think 3D V-Cache is everywhere.

Doubtful unless they've fixed a lot of the performance quirks and are moving the L3 off the base die. V-cache does nothing for most consumer workloads outside of gaming or isn't worth the loss in clock speed it currently brings. Some professional users certainly benefit, but it's not worth the added cost to make everyone pay.
 

Joe NYC

Platinum Member
Jun 26, 2021
2,157
2,738
106
Why use 2x8c instead of 1x16c, for the 16c client CPU variants of Zen6?

Assuming there will be 16c CCD of Zen6 available anyway, why not use them on client too? Also opens up for 2x16c on client CPUs, and 1x16c + 1x8c.

Because of 95% of units sold on client will have a single 8 core CCD.

The server CCDs may be optimized for different settings, and also, may not be at all compatible from client style packaging.

So it will still be the most efficient to serve the 95% with single CCD and the remaining 5% with dual 8c CCDs.
 
Reactions: Tlh97

Ajay

Lifer
Jan 8, 2001
15,783
7,995
136
Zen5 is already old news. Leaks about Zen6 starting to appear now:


View attachment 99257
View attachment 99258
LOL! Zen5 isn't even released yet.

Here's another idea. Start an New Zen6 thread. I'd start it, but I don't know a damn thing.
 

StefanR5R

Elite Member
Dec 10, 2016
5,633
8,107
136
Going to 16 cores on a single ccx would reduce core to core latency,
and energy consumption of intra-CCX traffic is lower than that of inter-CCX traffic too. But the workloads in which this matters are rarely seen on client computing devices.

but they will probably need upgrade their ring interconnect to either a 4x4 tile interconnect or something better.
which, in turn, is bound to increase the cache's power consumption, or cache performance will regress (for some access patterns).
 
Jul 28, 2023
176
614
96
The thread has cooled down a little.
So what are everyone's final guesses on the perf increase now that Computex is around the corner? I think pretty much everything that could leak already did at this point, and we're not getting any more info until the announcement.

My guess is +21% IPC, +200MHz for top desktop chip, take it or leave it. Zen 3 reloaded basically.

edit: SIR 2017.
 
Last edited:

Goop_reformed

Member
Sep 23, 2023
191
259
96
The thread has cooled down a little.
So what are everyone's final guesses on the perf increase now that Computex is around the corner? I think pretty much everything that could leak already did at this point, and we're not getting any more info until the announcement.

My guess is +21% IPC, +200MHz for top desktop chip, take it or leave it. Zen 3 reloaded basically.
25%+ st uplift. Higher in synthetic, lower in real world usage. 20% for multi.
 
Reactions: spursindonesia

Abwx

Lifer
Apr 2, 2011
11,103
3,780
136
The thread has cooled down a little.
So what are everyone's final guesses on the perf increase now that Computex is around the corner? I think pretty much everything that could leak already did at this point, and we're not getting any more info until the announcement.

My guess is +21% IPC, +200MHz for top desktop chip, take it or leave it. Zen 3 reloaded basically.

edit: SIR 2017.
Not easy to draw conclusive estimations out of half baked GB numbers with an ES
or from the Blender bench in unknown conditions.

I will stand with my 22.47% minimal average IPC extracted out of nebulous statistics based on said early numbers and from the alleged uarch evolution.
 

Philste

Member
Oct 13, 2023
95
194
66
No matter if IPC is the right word or not:

ZEN5: 22-23% IPC, 5.8 GHz for top CPU in Desktop stack

Lion Cove: 17-18% IPC, 5.5GHz max

So my guess is ZEN5 has 10% better singlethread than top ARL and arrived a quarter earlier (or more). ARL will be competitive in multi tho and maybe even more efficient in multi.

Gaming wise, I think both new products will fail to beat their predecessors. ZEN5 because X3D is just too strong, ARL because of horrible Cache characteristics.
 

adroc_thurston

Platinum Member
Jul 2, 2023
2,818
4,155
96

Joe NYC

Platinum Member
Jun 26, 2021
2,157
2,738
106
and energy consumption of intra-CCX traffic is lower than that of inter-CCX traffic too. But the workloads in which this matters are rarely seen on client computing devices.

Yeah, I am really curious about the use case where this matter, and I can't think of a good one.

The real issue with 2 CCDs (in client) is thread jumping from 1st CCD to 2nd CCD, while all it's cached data is on 1st CCD.

It seems to me that this case can be mittigated by some sort of algorithm that would have awareness of content of both CCD caches, and being able to copy content of L3s between CCDs.

Also, the new packaging for Strix Halo could allow direct (and fast) CCD to CCD communication.
 

adroc_thurston

Platinum Member
Jul 2, 2023
2,818
4,155
96
Jul 28, 2023
176
614
96
Vidya too.
What effectively is a dingus quad-core won a ton of gaming laptops.

HC in general is insanely stacked this year.
Vidya benefits a lot from 3D-cache as well, so not sure if front-end improvements will be big enough to offset that and add some performance on top.
this is far bigger and meaner than Zen3.
Well, wider does not always translate to large perf improvements (*cough* A17 Pro P-cores vs Everest), and leaked memebench numbers, while good, weren't THAT good (obviously those were from ES chips that have all sorts of immature firmware and locks).
 

adroc_thurston

Platinum Member
Jul 2, 2023
2,818
4,155
96
so not sure if front-end improvements will be big enough to offset that and add some performance on top.
of course. child's play.
Well, wider does not always translate to large perf improvements
This is AMD we're talking about.
and leaked memebench numbers, while good, weren't THAT good (obviously those were from ES chips that have all sorts of immature firmware and locks).
cinememe isn't a relevant workload. basically doesn't exist as a target for uarch people.
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |