Question Zen 6 Speculation Thread

Page 105 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

Joe NYC

Platinum Member
Jun 26, 2021
2,988
4,367
106
Here is a dumb question about the "backside power delivery"
- what is the back side?
- why not call it motherboard side and heatsink side?
- why not call it silicon transistor side vs. wire side (before BSPD)
- what happens to this terminology in a flip chip? They say the "Up" and "Down" are flipped, without ever explaining what is "Up" and what is "Down"

Here comes the real question:
- if the "backside power delivery" means power on the side away from the motherboard, how does this power get there? It would make sense for the power to be on the motherboard side, but then, I am guessing, multiple layers of metal connections, if on heatsink side, would present some thermal resistance...

Edit: the context to it is how this can work with V-Cache
 
Last edited:

Hitman928

Diamond Member
Apr 15, 2012
6,615
12,131
136
Here is a dumb question about the "backside power delivery"
- what is the back side?
- why not call it motherboard side and heatsink side?
- why not call it silicon transistor side vs. wire side (before BSPD)
- what happens to this terminology in a flip chip? They say the "Up" and "Down" are flipped, without ever explaining what is "Up" and what is "Down"

Here comes the real question:
- if the "backside power delivery" means power on the side away from the motherboard, how does this power get there? It would make sense for the power to be on the motherboard side, but then, I am guessing, multiple layers of metal connections, if on heatsink side, would present some thermal resistance...

Edit: the context to it is how this can work with V-Cache

- what is the back side?

Backside is the side opposite the metal layers when looking at a vertical cross-section. Front side has the metal layers and is traditionally where the pads are.

- why not call it motherboard side and heatsink side?

Because you can have flip chip and non flip chip designs, so this wouldn’t help you know what’s what in the die.

- why not call it silicon transistor side vs. wire side (before BSPD)

Too cumbersome.

- what happens to this terminology in a flip chip? They say the "Up" and "Down" are flipped, without ever explaining what is "Up" and what is "Down"

Nothing happens. Backside is still backside, it’s just now flip chip orientation so the backside is now the “top” for the purpose of this discussion.

- if the "backside power delivery" means power on the side away from the motherboard, how does this power get there?

You either use TSVs to go through the logic die to the BSPDN die and then back to the backside of the logic die, or the logic die is no longer flip chip and sits “on top” of the BSPDN die.

-Edit: the context to it is how this can work with V-Cache

Not sure if AMD is really planning to use BSPDN in the near future, but maybe the V-cache die and BSPDN die are one die in this scenario. Or you get into cache filled interposers or something. Not sure, haven’t really thought about it.
 

511

Golden Member
Jul 12, 2024
1,899
1,705
106
Here is a dumb question about the "backside power delivery"
- what is the back side?
- why not call it motherboard side and heatsink side?
- why not call it silicon transistor side vs. wire side (before BSPD)
- what happens to this terminology in a flip chip? They say the "Up" and "Down" are flipped, without ever explaining what is "Up" and what is "Down"

Here comes the real question:
- if the "backside power delivery" means power on the side away from the motherboard, how does this power get there? It would make sense for the power to be on the motherboard side, but then, I am guessing, multiple layers of metal connections, if on heatsink side, would present some thermal resistance...

Edit: the context to it is how this can work with V-Cache
This video explains it nicely
 

Mopetar

Diamond Member
Jan 31, 2011
8,350
7,420
136
That s still too high, AMD never used the other processes at this point of the curves, rather when it reach +2Q, wich amount to one year from now plus the necessary time to have enough inventory, so a total of roughly 6 quarters, wich lead us to early Q4 2026 at best.

There's no reason they couldn't. Their chiplets are smaller than Apple's A-series SoCs and AMD sells a lot of six core CPUs so even if the defect density is still high they're not all that bad off since the likelihood of dead silicon is fairly low.

TSMC has to sell the capacity to someone and they can negotiate special pricing until defect density hits some predefined target if a company like AMD would otherwise be hesitant to buy in.

I assume that due to how L3 and V-Cache are connected, the V-Cache will have to be 48MB*x as well.

No it wouldn’t. The additional v-cache just increases the associativity of the cache, so it doesn't need to match the size at all and only needs to be a multiple of the size of a set.

While there's theoretically nothing stopping them from designing a cache that functions differently, it would require a lot of added complexity as the cache needs to be able to decode for multiple modes. There's also not a lot of benefit to making an L3 with larger blocks/lines than the L2 and L1 cache support. It does turn one (or maybe two) misses on highly sequential data access into hits, but the added latency to fill and access larger cache lines may not be worth the trade off as a lot of software is already optimized around L1/L2 cache sizes.

Maybe if AVX became a lot more popular it would be worth doing something like that so you can load a massive vector into L3 cache, but existing code bases wouldn't benefit all that much. Servers likely would just want more sets anyway so they don't benefit from the design change either.
 

inquiss

Senior member
Oct 13, 2010
399
562
136
They can announce it during launch to be available in X months, how about that? And start pre-orders immediately. Then people have certainty.

Also Apple prepares tens of millions of iPhones for launch - somehow they can do it, and it's far more effort than put 1000 chips into trays and off you go.
On your first point, they could do. People already expect it so I doubt it Osbornes that much. Depends if other products are using the packaging though, if server products use that more next gen then they might not know when they'll release the desktop version because you'd want to sell as much as possible in server, if packaging is the bottleneck.

On the second point. Apple don't by having lots of capacity and stocking for launch. If they have 3D stacked cache on their chips they'd be limited in production volumes and, even if that were not the case, would have to launch later than they do. That simple really.
 
Reactions: Thibsie

basix

Member
Oct 4, 2024
106
221
76
I assume that due to how L3 and V-Cache are connected, the V-Cache will have to be 48MB*x as well.
I don't know if required but at least we hope for that 144MByte L3$ sounds great.

When I look at the Zen 5 CCD and assuming that the 12C Die gets a little bit smaller (e.g. 60mm2 instead of 70mm2, best case), it would be a nice fit to use N4C for the 96 MByte cache chiplet. Memory density should fit really well.
So we have a roughly ~1.2x smaller Die at a ~1.25x more expensive node (according to early AMD estimates of N7 and N5) or in other words roughly the same cost for the cache Die. Not too bad for 1.5x memory capacity.
The N2 CCD is probably a bit more expensive despite the smaller Die area.

But even if it should stay at ~70mm2 it is probably worthwile to go for 96MByte instead of only 64MByte. The cost difference overall ist not that huge (N2 CCD + X3D Cache-Die + IOD). And the additional area could get used for some deep trench capacitors to boost CPU clock rates (see Graphcore IPU, 2nd Link). And we keep the 12MByte/Core L3$ ratio from all prior V-Cache CPUs (SW tuning/compatibility).


 

Makaveli

Diamond Member
Feb 8, 2002
4,916
1,504
136
I don't know if required but at least we hope for that 144MByte L3$ sounds great.

When I look at the Zen 5 CCD and assuming that the 12C Die gets a little bit smaller (e.g. 60mm2 instead of 70mm2, best case), it would be a nice fit to use N4C for the 96 MByte cache chiplet. Memory density should fit really well.
So we have a roughly ~1.2x smaller Die at a ~1.25x more expensive node (according to early AMD estimates of N7 and N5) or in other words roughly the same cost for the cache Die. Not too bad for 1.5x memory capacity.
The N2 CCD is probably a bit more expensive despite the smaller Die area.

But even if it should stay at ~70mm2 it is probably worthwile to go for 96MByte instead of only 64MByte. The cost difference overall ist not that huge (N2 CCD + X3D Cache-Die + IOD). And the additional area could get used for some deep trench capacitors to boost CPU clock rates (see Graphcore IPU, 2nd Link). And we keep the 12MByte/Core L3$ ratio from all prior V-Cache CPUs (SW tuning/compatibility).


I can see this with the smaller cores and more room to move the cache up from 64 to 96.
 
Reactions: basix

Doug S

Diamond Member
Feb 8, 2020
3,120
5,362
136
On the second point. Apple don't by having lots of capacity and stocking for launch. If they have 3D stacked cache on their chips they'd be limited in production volumes and, even if that were not the case, would have to launch later than they do. That simple really.

That's not really a problem for Apple. They aren't likely to stack cache on iPhone SoCs, or base level Apple Silicon. It would probably be something they did only for Max dies, and not all of them. Those are lower volume parts and the products containing them aren't released on a set schedule so the production volume limitations and having to launch a bit later wouldn't be an issue.
 

Jan Olšan

Senior member
Jan 12, 2017
520
1,026
136
A14 is too late for Zen7, A16 is most likely since it's a half node from N2.
Is it? It feels more like a parallel technology. It is coming later than N2, but parallel to N2P and it's apparently a backside-power-delivery enabled version of N2P.

Apparently TSMC is branching between processes with backside power delivery and processes without.
N2P -> A14 is the non-BPD line and A16 -> (yet unnanounced process that will be A14+BPD) is the second one with BPD.

Actually I wonder what will AMD use, will they go for backside power delivery? If yes, then that rules A14 out. Well, of course they can use both for different products.

(BTW, the recent TSMC slides also put N2X production only into 2027, so potentially that contradicts MLID info about Zen 6 using N2X, I'd expect it to be on plain N2 instead. Actually that kind of goes without saying - if AMD was one of the first clients taping out a chip - for Genoa - then that chip can't be using N2X).
 
Last edited:

inquiss

Senior member
Oct 13, 2010
399
562
136
That's not really a problem for Apple. They aren't likely to stack cache on iPhone SoCs, or base level Apple Silicon. It would probably be something they did only for Max dies, and not all of them. Those are lower volume parts and the products containing them aren't released on a set schedule so the production volume limitations and having to launch a bit later wouldn't be an issue.
So I think what you're saying is, yes, if Apple added 3D vcache to a part it would launch later. So I think you're agreeing that if AMD launched vcache first it would just be a delay of everything else. Fairly sure you're agreeing with me, otherwise I'm missing the point of your post. If I am by all means let me know what I've missed.
 

RTX

Member
Nov 5, 2020
164
118
116
- if the "backside power delivery" means power on the side away from the motherboard, how does this power get there?

You either use TSVs to go through the logic die to the BSPDN die and then back to the backside of the logic die, or the logic die is no longer flip chip and sits “on top” of the BSPDN die.

-Edit: the context to it is how this can work with V-Cache

Not sure if AMD is really planning to use BSPDN in the near future, but maybe the V-cache die and BSPDN die are one die in this scenario. Or you get into cache filled interposers or something. Not sure, haven’t really thought about it.
There's also Vertical Power Delivery ( separate from backside power ) moves the pmics under the cpu/gpu dies.
 

Win2012R2

Senior member
Dec 5, 2024
893
852
96
If they have 3D stacked cache on their chips they'd be limited in production volumes and, even if that were not the case, would have to launch later than they do. That simple really.
No: if Apple had 3D stacked cache on their chips then they would make sure they got enough capacity to sell sell sell. That's what Apple would do.

What's harder, slower and more expensive to make - N2 fab or create enough capacity for 3D cache integration?
 

OneEng2

Senior member
Sep 19, 2022
512
742
106
Is it? It feels more like a parallel technology. It is coming later than N2, but parallel to N2P and it's apparently a backside-power-delivery enabled version of N2P.

Apparently TSMC is branching between processes with backside power delivery and processes without.
N2P -> A14 is the non-BPD line and A16 -> (yet unnanounced process that will be A14+BPD) is the second one with BPD.

Actually I wonder what will AMD use, will they go for backside power delivery? If yes, then that rules A14 out. Well, of course they can use both for different products.

(BTW, the recent TSMC slides also put N2X production only into 2027, so potentially that contradicts MLID info about Zen 6 using N2X, I'd expect it to be on plain N2 instead. Actually that kind of goes without saying - if AMD was one of the first clients taping out a chip - for Genoa - then that chip can't be using N2X).
From what I have been able to read and discuss, BSPDN isn't a slam dunk for everything; however, it works especially well for some things.

I like that TSMC intends to make their libraries compatible between N2 and A16 w/ BSPDN. This will make it easy for design houses to test their designs on either process to determine which process works best for their design. To date, Intel does not have anything like this that I am aware of.
 

branch_suggestion

Senior member
Aug 4, 2023
647
1,366
96

inquiss

Senior member
Oct 13, 2010
399
562
136
No: if Apple had 3D stacked cache on their chips then they would make sure they got enough capacity to sell sell sell. That's what Apple would do.

What's harder, slower and more expensive to make - N2 fab or create enough capacity for 3D cache integration?
I don't think you understand some basics here. Adding 3D vcache takes time. From wafer start to product takes longer. You can make a chip without vcache in x time or you can make a chip worth 3D vcache in x+y time. Y is non zero. Capacity or not capacity they would have to launch later.
 

Win2012R2

Senior member
Dec 5, 2024
893
852
96
Capacity or not capacity they would have to launch later.
No, it does NOT have to launch later - they can first accumulate non-3D chips to actually have stock for launch, and some 3D ones that will be super popular and out of stock quickly, we are talking months here max, not years, so it's totally doable and should have been done.

The problem is the SoIC capacity for AMD
Is the problem with SoIC that it is God level tech that can not be replicated because all knowledge how it was done is now totally lost?

If AMD makes commitment, and it can't be too expensive, this ain't N2 fabs, then everything will be done with bells on. That's the problem - AMD.
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |