Discussion Zen 5 Speculation (EPYC Turin and Strix Point/Granite Ridge - Ryzen 9000)

DisEnchantment · Sep 29, 2022

Speculate at will

carancho · Apr 21, 2024

adroc_thurston said:
That's everything since Kaveri.

That's aperture. It slides.

Kaveri long predates Apple efforts.
Gotta give Timmy props for marketing their stuff like magic.

Oh so a current gen Zen 4 APU would have access to the full RAM for running ollama? That's great to learn. So it's better to wait for a large RAM Halo than going for an Nvidia GPU laptop.

adroc_thurston · Apr 21, 2024

carancho said:
Oh so a current gen Zen 4 APU would have access to the full RAM for running ollama? That's great to learn. So it's better to wait for a large RAM Halo than going for an Nvidia GPU laptop.

Kinda the selling point yes.
Bandwidth is lacking though since it's only a 256b L5x setup.

carancho · Apr 21, 2024

adroc_thurston said:
Kinda the selling point yes.
Bandwidth is lacking though since it's only a 256b L5x setup.

Sure, that's why I'm interested in Strix Halo, it seems it'll be the only contender to Macs for "medium" sized models requiring 64-128GB of RAM and good speeds. (It's either that or 4x4090, which is another product entirely. Also inference speeds are quite good with the M1...3 GPUs and NPUs, 4x4090 would be overkill in that sense.)

S'renne · Apr 21, 2024

Hmm? Wouldnt it run out of memory if you want to run other stuff like Discord/Teamspeak and maybe a browser besides gaming if despite unified, RAM has to be temporarily dedicated to the GPU, unlike Macs?(Could be misunderstanding something?

adroc_thurston said:
That's everything since Kaveri.

That's aperture. It slides.

Kaveri long predates Apple efforts.
Gotta give Timmy props for marketing their stuff like magic.

Ghostsonplanets · Apr 21, 2024

Did anyone shared Kepler tweet that Client Zen 5 is using N4X?

https://twitter.com/x/status/1781233145116774763

"N4Twitter"

N4X for GNR, STX, STH, KRK. SF4X for SNV

Z5 is the generation X😆

Saylick · Apr 21, 2024

That would be interesting, especially since we have not had a consumer product use an X node.

TSMC Details N4X Process for HPC: Extreme Performance at Minimum Leakage

www.anandtech.com

While N4X offers significant performance enhancements compared to N4 and N4P, it continues to use the same SRAM, standard I/O, and other IPs as N4P, which enables chip designers to migrate their designs to N4X easily and cost effectively. Meanwhile, keeping in mind N4X's IP compatibility with N4P, it is logical to expect transistor density of N4X to be more or less in line with that of N4P. Though given the focus of this technology, expect chip designers to use this technology to get extreme performance rather than maximum transistor density and small chip dimensions.

Ghostsonplanets · Apr 21, 2024

Saylick said:
That would be interesting, especially since we have not had a consumer product use an X node.

TSMC Details N4X Process for HPC: Extreme Performance at Minimum Leakage

www.anandtech.com

View attachment 97455

I think this is also an interesting tidbit:

In particular, N4X adds four new devices on top of the N4P device offerings, including ultra-low-voltage transistors (uLVT) for applications that need to be very efficient, and extremely-low threshold voltage transistors (eLVT) for applications that need to work at high clocks. For example, N4X uLVT with overdrive offers 21% lower power at the same speed when compared to N4P eLVT, whereas N4X eLVT in OD offers 6% higher speed for critical paths when compared to N4P eLVT.

Now that AMD has separated Zen 5 Client into Classic and Dense, they can customize Classic to use eLVT for higher clocks while Dense use uLVT for higher efficiency at lower clocks.

It makes even more sense with Adroc remarks that the delta between Z5 Mobile and Snap Elite in battery life isn't as big.

adroc_thurston · Apr 21, 2024

S'renne said:
RAM has to be temporarily dedicated to the GPU, unlike Macs?(Could be misunderstanding something?

That's not how OS-managed memory allocation works!

Ghostsonplanets said:
Now that AMD has separated Zen 5 Client into Classic and Dense, they can customize Classic to use eLVT for higher clocks while Dense use uLVT for higher efficiency at lower clocks.

Oohhhh, you're way smarter than the average poster here.

dr1337 · Apr 21, 2024

adroc_thurston said:
Zen3 literally piled on the port count.
There's a ton more EUs, just more specialized ones.

Zen 3 added IO, it didn't add ALUs. Is there a reason why you left that out of their quote? More IO is a wider design but it isn't the same as more execution units. Neither of you are wrong.

PJVol said:
What did I just read...? )

Their point is pretty simple so idk why ppl are being overly dramatic.

dr1337 · Apr 21, 2024

Ghostsonplanets said:
It makes even more sense with Adroc remarks that the delta between Z5 Mobile and Snap Elite in battery life isn't as big.

It already made sense considering everyone knows about zen4c and it would be really unwise to assume AMD aren't going to use every tool TSMC has to make zen5c as efficient as possible. No company would let tech sit on the table when it's available to them and their competitors are also using it.

S'renne · Apr 21, 2024

adroc_thurston said:
That's not how OS-managed memory allocation works!

Oohhhh, you're way smarter than the average poster here.

...I'm basing this off what I think I know how Windows allocates memory between the CPU and iGPU but I should've asked others about this instead.

I asked because I've heard that Windows dynamically allocates and sandboxes memory shared to the programs running on the iGPU separate from the CPU, unlike MacOS or Linux, but nevermind asking for you to bother clarifying on what's different on Strix Halo.

branch_suggestion · Apr 22, 2024

Ghostsonplanets said:
Now that AMD has separated Zen 5 Client into Classic and Dense, they can customize Classic to use eLVT for higher clocks while Dense use uLVT for higher efficiency at lower clocks.

Zen6 takes this even further, if you can figure out the one area AMD is not entirely leading in.

itsmydamnation · Apr 22, 2024

adroc_thurston said:
Zen3 literally piled on the port count.
There's a ton more EUs, just more specialized ones.

But they didn't increase the PRF read/write port count which is the fundamental limit of execution concurrency within the Core. Which is why i say Zen1->4 are fundamentally the same execution width , With the Work in Zen3 they made the average port usage closer to the Peak.

itsmydamnation · Apr 22, 2024

AMDK11 said:
From generation to generation, Zen was larger in terms of the logic used and the number of transistors used for it. Zen 3 compared to Zen 2 is generally a redesign of the control logic and the algorithms contained in it, and an expansion of about 14%. This proves that the logic in Zen 2 was not designed optimally for the amount of resources.

Yes that's is true of any Core , because it takes 1000's of engineering hours per 0.1% of performance , so it is truly a function of time. You have to bank what you have got and ship at some time. This is true of any engineering exercise.

ashFTW · Apr 22, 2024

I haven’t been following this thread so I have a quick question for the forum members. I’m looking to acquire an HP Z6 G5 7995WX workstation, but before I do I’m curious when the Zen 5 version (128 cores?) of Threadripper chip is expected. Thanks!

Fjodor2001 · Apr 22, 2024

branch_suggestion said:
Zen6 takes this even further, if you can figure out the one area AMD is not entirely leading in.

In what way does Zen6 take it further? And are such details already known about Zen6?

Goop_reformed · Apr 22, 2024

ashFTW said:
I haven’t been following this thread so I have a quick question for the forum members. I’m looking to acquire an HP Z6 G5 7995WX workstation, but before I do I’m curious when the Zen 5 version (128 cores?) of Threadripper chip is expected. Thanks!

Almost 2 years if not more i'm afraid. Best bet are used genoa or regular zen 4 threadripper.

ashFTW · Apr 22, 2024

Goop_reformed said:
Almost 2 years if not more i'm afraid. Best bet are used genoa or regular zen 4 threadripper.

Aren’t the Zen 5 EPYCs coming later this year? Since the platform doesn’t change(?) from Zen4 to Zen5, I was hoping next Threadripper is released early to mid next year. I can’t wait even that long anyways. Thanks!

Saylick · Apr 22, 2024

branch_suggestion said:
Zen6 takes this even further, if you can figure out the one area AMD is not entirely leading in.

Just a total dart throw, but one thing it would be nice to see AMD do is to match Intel in reducing their idle power consumption via the use of a low power core. I recall there being an AMD patent a ways back where they had the cache of a big core be shared with a small core or something to that effect where in essence the workload could be passed between the cores without much penalty. If Zen 6’s IOD uses N4X then you could make that small core pretty power efficient by using those uLVT transistors and shutting down the compute die. Plus, the use of Infinity Link or whatever AMD calls it to connect the compute die to the IOD may allow for the small core approach to work. Given that Adroc hinted that Zen 6 desktop is more mobile-like than ever, this idea is not too far fetched in my opinion.

Edit: found the article.

AMD patents a task transition method between BIG and LITTLE processors - VideoCardz.com

AMD “big.LITTLE” aka heterogeneous computing in Ryzen 8000 series The next decade will no longer be dictated by the number of cores, but rather the processor’s fabrication node, packaging method, and power effciency. A big role will also be played by heterogeneous architectures. Later this year...

videocardz.com

Goop_reformed · Apr 22, 2024

ashFTW said:
Aren’t the Zen 5 EPYCs coming later this year? Since the platform doesn’t change(?) from Zen4 to Zen5, I was hoping next Threadripper is released early to mid next year. I can’t wait even that long anyways. Thanks!

If you can get one then sure, Turin is always the 1st choice.

Gideon · Apr 22, 2024

ashFTW said:
Aren’t the Zen 5 EPYCs coming later this year? Since the platform doesn’t change(?) from Zen4 to Zen5, I was hoping next Threadripper is released early to mid next year. I can’t wait even that long anyways. Thanks!

I wouldn't be holding my breath.

Zen 4 Threadrippers came a year after zen 4 release
Zen 3 Threadrippers came out Mar 8, 2022, despite Zen 3 being released November 5, 2020. True, these were OEM only and workstation CPUs, but that also seems what you want anyways.

Unfortunately Threadripper just doesn't seem to be a priority for AMD. Things might change this gen but I personally wouldn't expect it before 2H 2025.

uzzi38 · Apr 22, 2024

Saylick said:
...I recall there being an AMD patent a ways back where they had the cache of a big core be shared with a small core or something to that effect where in essence the workload could be passed between the cores without much penalty.
...
Edit: found the article.

AMD patents a task transition method between BIG and LITTLE processors - VideoCardz.com

AMD “big.LITTLE” aka heterogeneous computing in Ryzen 8000 series The next decade will no longer be dictated by the number of cores, but rather the processor’s fabrication node, packaging method, and power effciency. A big role will also be played by heterogeneous architectures. Later this year...

videocardz.com

This patent reminds me a lot of A10 Fusion, where the little cores ended up being almost transparent to software because if a workload was heavy enough it would transition from the little cores to the big cores, and primarily use those instead (not sure how that was determined by the OS/hardware, but I wouldn't be surprised if it relied upon the types of instructions run - like in that patent - or the workload duration).

A very hardware solution to the whole big.LITTLE scheduling problem. But Apple still ended up dropping it pretty quickly afterwards. It's why it was such a surprise to see AMD patent almost the same idea back then too.

adroc_thurston · Apr 22, 2024

uzzi38 said:
This patent reminds me a lot of A10 Fusion, where the little cores ended up being almost transparent to software because if a workload was heavy enough it would transition from the little cores to the big cores, and primarily use those instead (not sure how that was determined by the OS/hardware, but I wouldn't be surprised if it relied upon the types of instructions run (like in that patent) or the workload duration.

you forgot to mention the funniest part, A10 had LITTLEs grow off the bigs like a cancerous tumor.

naukkis · Apr 22, 2024

itsmydamnation said:
But they didn't increase the PRF read/write port count which is the fundamental limit of execution concurrency within the Core. Which is why i say Zen1->4 are fundamentally the same execution width , With the Work in Zen3 they made the average port usage closer to the Peak.

No, what they did do with Zen3 is to use marco-ops instead of micro-ops - they reduced PRF usage by letting macro-ops transfer data directly between execution units so they could increase concurrency without increasing PRF throughput.

itsmydamnation · Apr 22, 2024

naukkis said:
No, what they did do with Zen3 is to use marco-ops instead of micro-ops - they reduced PRF usage by letting macro-ops transfer data directly between execution units so they could increase concurrency without increasing PRF throughput.

They did do what I said , that's why they had dedicated branch execution units. I'm also not 100% sure on some people's thinking on zen3, AMD have had Mop and op fusion for along time, I wouldn't be so quick to assume that mops and op fusion Didn't exist in zen 1/2 because they existed as far back as bulldozer.

Discussion Zen 5 Speculation (EPYC Turin and Strix Point/Granite Ridge - Ryzen 9000)

Golden Member

Member

Platinum Member

Member

Member

Senior member

Diamond Member

Senior member

Platinum Member

Senior member

Senior member

Member

Senior member

Platinum Member

Platinum Member

Senior member

Diamond Member

Member

Senior member

Diamond Member

Member

Golden Member

Platinum Member

Platinum Member

Senior member

Platinum Member