Discussion Zen 5 Speculation (EPYC Turin and Strix Point/Granite Ridge - Ryzen 9000)

Page 291 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

Timmah!

Golden Member
Jul 24, 2010
1,430
660
136
I predict there will be predictions about IPC.

Btw, it is interesting AMD is so tight-lipped not only about performance, but also about architecture.
I know. I predicted you would be inclined to do so.
Them being so tight-lipped has to mean one of 2 things: either its so good, they dont want to spoil the surprise, or so meh, they dont want to kill the hype.
Or not. :-D
 
Reactions: lightmanek

AMDK11

Senior member
Jul 15, 2019
290
197
116
That's pretty raw cope.
Solutions and very large milestones either translate into large IPC increases or you're ARM Austin/Sofia. And you suck.

that's not how semicon pathfinding even works.
the new bits of Zen5 have like, 0 relation to previous Zens, that's the point.
They're not learnings, but novel crackpot concepts of doom.
When writing about research and teaching throughout the entire period of work on Zen 1-4, I mean that not all developed solutions were implemented, but left to subsequent generations.

A microarchitecture that will result in a huge average IPC increase is not necessarily a breakthrough or milestone for AMD. AMD has no choice but to expand the core to enable further IPC gains. At the same time, they implement new techniques.
Therefore, they covered the wider core with new techniques, which in itself is a milestone for AMD.

To achieve an average IPC gain of 30%, the core would need to be expanded (transistors) by approximately 50-60%. Nothing comes for free, and I don't think new techniques will suddenly result in huge increases in IPC without a large increase in transistor counts. ALU itself does not cost that many transistors. Simply adding 50% more ALUs does not necessarily result in 25-30% higher IPC in most workloads in the first generation of a wider core (chip area trade-offs + limited design time).

I understand you have high expectations for the Zen5's mid-range IPC gain. I think you will be disappointed. If it's +25-30% higher IPC on average, I'll be very impressed.

At the moment, I don't have such high expectations, because there has practically never been such a large average jump in x86 IPC from generation to generation. If anything, it's only because formally the previous generation had weak IPC.

Until strong evidence and verification emerges, I maintain my view that Zen 5 will offer an IPC curve ranging from +1-3% to +40-50% with an average I see as a target of +15-20%.
 
Last edited:
Reactions: Darkmont

NostaSeronx

Diamond Member
Sep 18, 2011
3,687
1,222
136
Is it impossible to design power efficient CMT cores?
It is actually pretty easy. One of the closed source CN RISC-V cores coming out soon-ish will be CMT-based. However, it is based more on the older clustered patents rather than the newer dual-core processor patents. With additional stuff like Gracemont's front-end and Zen3/Zen4's FPU, etc.


It is should be closely related to XiangShan's Nanhu: https://github.com/OpenXiangShan/XiangShan Readme section -> Architecture (image)

2x Front-ends => 2x Integer Blocks + 2x Floating Blocks
ST-mode => All resources to single-thread
MT-mode => Split resources to multi-thread
 
Last edited:
Reactions: igor_kavinski

adroc_thurston

Platinum Member
Jul 2, 2023
2,501
3,650
96
The A17 Pro isn't that small... there's a reason that N3 was 15% of TSMC's revenue in their last quarter. Probably in the low 100 range.
It's 15% total for all the iphone/mac/whatever N3 revenue.
When writing about research and teaching throughout the entire period of work on Zen 1-4, I mean that not all developed solutions were implemented, but left to subsequent generations.
Most are.
That's how semis R&D works.
A microarchitecture that will result in a huge average IPC increase is not necessarily a breakthrough or milestone for AMD
Yeah it is.
AMD has no choice but to expand the core to enable further IPC gains.
Bloat in vacuum doesn't do much, see Intel.
and I don't think new techniques will suddenly result in huge increases in IPC without a large increase in transistor counts.
Jesus that's like every Zen since Zen1.
Nothing's gonna change.
I understand you have high expectations for the Zen5's mid-range IPC gain
They're not expectations, no.
because there has practically never been such a large average jump in x86 IPC from generation to generation
x86 land sucked at IPC bumps.
Look somewhere else.
Until strong evidence and verification emerges,
Turin perf numbers are kinda public domain knowledge.
 
Reactions: Darkmont

AMDK11

Senior member
Jul 15, 2019
290
197
116
It's 15% total for all the iphone/mac/whatever N3 revenue.

Most are.
That's how semis R&D works.

Yeah it is.

Bloat in vacuum doesn't do much, see Intel.

Jesus that's like every Zen since Zen1.
Nothing's gonna change.

They're not expectations, no.

x86 land sucked at IPC bumps.
Look somewhere else.

Turin perf numbers are kinda public domain knowledge.

What Epyc Turin(Zen5) tests are you referring to and what do these tests show?
 
Last edited:
Reactions: Markfw

naukkis

Senior member
Jun 5, 2002
722
610
136
It is actually pretty easy. One of the closed source CN RISC-V cores coming out soon-ish will be CMT-based.
I don't know if there's coming clustered - split register file - risc-v design but it's sure that risc-v even considered to split registers in ISA level. They choose to use non-split register model but still design their ISA clustering friendly. And it's pretty obvious that very wide high clocking - best performing - cpu designs someday will be somehow clustered designs. From old studies it's found that clustering will lower IPC compared to equal wide non-clustered design so when clustered designs come it probably will be right away about twice as wide design compared to non-clustered.

But as even risc-v doesn't support hardware clustering at ISA level clustering support needs to be done at compiler-level - so move to clustered designs can pretty much only come from some quite big player. Anyone heard what are Jim Keller latest cpu visions?

Old hp clustering studies which are still relevant - very relevant when going to very wide cpu designs. https://www.hpl.hp.com/techreports/98/HPL-98-204.pdf
 
Feb 11, 2024
58
13
41
So felt like upgrading my mini pc. 9650hk? Was looking at 8600g, then looked at am5 motherboards and they had so many with bent pins working in only single channel. I guess it's easy to damage? Suggestions for best value/power usage home media box.
 

naukkis

Senior member
Jun 5, 2002
722
610
136
aka stuff that's never gonna happen.
you're welcome.

So basic clustering example - split 32 physical registers to 4 8 register groups. Make compiler to produce 4 different code paths withing thread using those 8 register sets. Heck - risc-v even have compressed instruction format for 8-register use cases, they just need to invent some hint for hardware to expand that 4-way clustering. Simplest thing to come mind is just pack two of those 16bit instructions in 32 or 64 bit instructions.
 

adroc_thurston

Platinum Member
Jul 2, 2023
2,501
3,650
96
So basic clustering example - split 32 physical registers to 4 8 register groups. Make compiler to produce 4 different code paths withing thread using those 8 register sets. Heck - risc-v even have compressed instruction format for 8-register use cases, they just need to invent some hint for hardware to expand that 4-way clustering.
that's not happening.
 

adroc_thurston

Platinum Member
Jul 2, 2023
2,501
3,650
96
Any particular reason to not chase that quite low hanging fruit to path for very wide cpu-core designs?
Because whacko ISA design gimmicks are so 1990.
They never worked nor they will ever work.
Because you don't want to put things in your ISA that require recompilation to benefit from them.
Bingo, stuff should work ootb or it's useless.
 
Reactions: Darkmont

JustViewing

Member
Aug 17, 2022
139
239
76
So basic clustering example - split 32 physical registers to 4 8 register groups. Make compiler to produce 4 different code paths withing thread using those 8 register sets. Heck - risc-v even have compressed instruction format for 8-register use cases, they just need to invent some hint for hardware to expand that 4-way clustering. Simplest thing to come mind is just pack two of those 16bit instructions in 32 or 64 bit instructions.
Isn't that what nvidia gpus do, MIMD?
 

naukkis

Senior member
Jun 5, 2002
722
610
136
again, leave whack ISA clownage in the 90s.
We're far past that.

ARM and x86 are adding instructions for every possible side-case. Risc-v is better but even there is million proposals to add very hardware-specific instructions. And ISA does not have to support clustering - compiler can cluster every ISA as it wants( only practical with enough hardware registers) and Risc-V so far have not added any clustering-specific instructions - only have kept path for clustering open for not implementing instructions that makes clustering difficult.
 

adroc_thurston

Platinum Member
Jul 2, 2023
2,501
3,650
96
ARM and x86 are adding instructions for every possible side-case
No?
No lol.
Risc-v is better but even there is million proposals to add very hardware-specific instructions
That's worse.
It's a mess. A very bad mess.
And ISA does not have to support clustering - compiler can cluster every ISA as it wants( only practical with enough hardware registers) and Risc-V so far have not added any clustering-specific instructions - only have kept path for clustering open for not implementing instructions that makes clustering difficult
Please leave schizo esoterica in 1990s.
It's not coming back. Ever.
 
Reactions: Nothingness

naukkis

Senior member
Jun 5, 2002
722
610
136
Please leave schizo esoterica in 1990s.
It's not coming back. Ever.

It's already here. Hardware tries to split code to concurrently executable clusters - why shouldn't ISA make that as easy as possible for hardware? Epic failures happen for making things too complicated - I think that they didn't even publish any hardware with clustered execution.
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |