Question Speculation: RDNA2 + CDNA Architectures thread

uzzi38 · Apr 28, 2020

All die sizes are within 5mm^2. The poster here has been right on some things in the past afaik, and to his credit was the first to saying 505mm^2 for Navi21, which other people have backed up. Even still though, take the following with a pich of salt.

Navi21 - 505mm^2

Navi22 - 340mm^2

Navi23 - 240mm^2

Source is the following post: https://www.ptt.cc/bbs/PC_Shopping/M.1588075782.A.C1E.html

SteveGrabowski · Apr 28, 2020

Gotta think Navi 2 on PC is going to be impressive when they can get a 12.2 TFLOPS gpu into the XBox Series X.

Glo. · Apr 28, 2020

Anyone care to speculate about possible CU counts, solely based on XSX die size, and CU density?

Hitman928 · Apr 28, 2020

So Navi21 basically twice the size of 5700XT. If they can get even 70% performance scaling from that, it's going to be a beast of a gaming chip and hold the performance crown until whenever the next gen Nvidia GPU launches.

Ajay · Apr 28, 2020

That's weird, listing die sizes on a shopping site? And with no units?

uzzi38 · Apr 28, 2020

Ajay said:
That's weird, listing die sizes on a shopping site? And with no units?

This site gets leeks posted on it all the time, like everything Sharkbay posts he posts on here for example.

uzzi38 · Apr 28, 2020

Translation provided by everyone's favourite RetiredEngineer:

https://twitter.com/x/status/1255206794994937856

DisEnchantment · Apr 28, 2020

uzzi38 said:
All die sizes are within 5mm^2

I imagine he said this probably because he was only eyeballing it or took a pic and did not have the official dimensions.

Glo. said:
Anyone care to speculate about possible CU counts, solely based on XSX die size, and CU density?

Assuming 320mm^2 for XSX GPU with 56CUs (including disabled ones), and assuming Mem controllers scale lineraly with CU count (because why not )

Navi21 - 505mm^2 = 88.3CUs
Navi22 - 340mm^2 = 59.5CUs
Navi23 - 240mm^2 = 42CUs

It makes sense for the Navi23 part to be the smallest considering the planned integration with Cezanne H.
Also interesting in the thread is the mention that Navi2X is no longer 7nm according to TSMC's definition.

The good indications, at least for clock speeds, is that the PS5 GPU is running at 2.23 GHz in a console and David Wangs verbal statement of "Multi Gigahertz" frequencies for RDNA2 during FAD2020.
So for desktop it would be safe to expect 2.3 GHz for the smallest Navi2X die.

TSMC's N7 went into HVM on April 2018 and Instinct MI60 was launched on November 2018. I have a small hope that CDNA1 could launch with TSMC's N5 or N6 otherwise.

AMD's quarterly report is in few hours and we will know if console chips were also being shipped.

PS:
@uzzi38
Should have made a more generic RDNA2 (and CDNA) thread. We can discuss more things besides die sizes .
You can edit the thread title to something like Speculation: Next Gen RDNA2/CDNA1 thread.

DiogoDX · Apr 28, 2020

Glo. said:
Anyone care to speculate about possible CU counts, solely based on XSX die size, and CU density?

80, 60, 40 would make more sense.

Glo. · Apr 28, 2020

DiogoDX said:
80, 60, 40 would make more sense.

At first glance its obvious conclusion.

At second - not really. Not at all.

DiogoDX · Apr 28, 2020

Glo. said:
At first glance its obvious conclusion.

At second - not really. Not at all.

I'm expecting the desktop chips to be less dense than the consoles for clock reasons.

Glo. · Apr 28, 2020

DiogoDX said:
I'm expecting the desktop chips to be less dense than the consoles for clock reasons.

Then they would have to be less dense than RDNA1 GPUs, considering the XSX has the same xTor/mm2 density as Navi 10 GPUs.

And that is not possible, because RDNA2 CUs appear to be around 20% more transistor efficient than RDNA1 CUs.

CastleBravo · Apr 28, 2020

DiogoDX said:
80, 60, 40 would make more sense.

Yep. I also suspect that Navi21 will have both GDDR and HBM controllers with the "XT" model having the HBM memory and full clock speeds while the non-"XT" has GDDR and slightly lower clocks to keep the power/heat down.

Ajay · Apr 28, 2020

CastleBravo said:
Yep. I also suspect that Navi21 will have both GDDR and HBM controllers with the "XT" model having the HBM memory and full clock speeds while the non-"XT" has GDDR and slightly lower clocks to keep the power/heat down.

I think it's pretty obvious that HBM only makes sense on professional/HPC AIBs - not consumer.

randomhero · Apr 29, 2020

Hitman928 said:
So Navi21 basically twice the size of 5700XT. If they can get even 70% performance scaling from that, it's going to be a beast of a gaming chip and hold the performance crown until whenever the next gen Nvidia GPU launches.

Well, if nvidia squeezed around 60 % more performance with 2080ti than 2060 I sure hope they do that at the minimum with new arch and tvice the size.

DisEnchantment · Apr 29, 2020

I hope this year we could expect HBM to get cheaper with the Pyongtaek fab(P1) going full steam for a few quarters now and a second one (P2) launched in the same plot will start producing memory this year, in addition to the 450K wpm currently being handled in this single P1 fab

Samsung’s 2nd Pyeongtaek memory fab likely to start operation early next year

Samsung's first memory fab in Pyeongtaek (Samsung Electronics)Samsung Electronics appears to be gearing up for the operation of its second memory plant in Pyeongtaek, Gyeonggi Province, in March next year, in a move seen to maintain its leadership despite unfavorable market circumstances...

www.koreaherald.com

At the same time Xi'an is also ramping up. Chinese Govt. is allowing Samsung Foundry engineers to move freely even during the covid situation. This fab might not produce Memory but will take wafer share from the Pyongtaek fabs to increase memory chip output.

Samsung's memory operations are ginormous.

Micron is also joining HBM bandwagon in addition to Samsung and SK Hynix and is starting operation this year.

Micron to Launch HBM2 DRAM This Year: Finally

www.anandtech.com

X3D architecture could not have come at a better time.

Glo. · Apr 29, 2020

Considering how GDDR6 will get cheaper in upcoming months/years with sheer volume manufacturing I would not put my hopes high for HBM2 on any consumer products, unless...

... any of the companies will be extremely desperate to remain relevant in consumer space for one reason or another.

Stuka87 · Apr 29, 2020

I think the idea of anything tech related getting cheaper soon time is unlikely. Cheaper than the inflated prices we have now, I can see that. Cheaper than pre-virus, unlikely.

Ajay · Apr 29, 2020

Stuka87 said:
I think the idea of anything tech related getting cheaper soon time is unlikely. Cheaper than the inflated prices we have now, I can see that. Cheaper than pre-virus, unlikely.

Yeah, 7nm is pretty expensive. DRAM is cheaper, but AIBs will probably just grab the extra margins instead of lowering prices.

DisEnchantment · Apr 29, 2020

Glo. said:
Considering how GDDR6 will get cheaper in upcoming months/years with sheer volume manufacturing I would not put my hopes high for HBM2 on any consumer products, unless...

... any of the companies will be extremely desperate to remain relevant in consumer space for one reason or another.

Mmmmmm .... I am not specifically talking about HBM for consumer devices. Of course CDNA is not a consumer product and current Instinct series use HBM2.

Having said that, it remains to be seen how expensive HBM2e is going to be. Right now there is excess DRAM capacity even without the P2 fab coming online.
HBM2 wafer availability increased many times over what is available two years ago because of numerous fab lines coming online like the gigantic fabs which I mentioned in the sources above.
Additionally, right now for the CDNA GPUs TSMC can do HBM2 integration from the KGSD dies with CoWoS in-house if needed.
Compare this to V64, where they need to get the KGSDs from Hynix, the GPU die from GloFo, the interposer from UMC and send them to SPIL who will integrate them all.

As regards HBM2e vs DDR5 in the context of X3D and CDNA2, it is going to be an interesting comparison because DDR5 is more complex than DDR4, there is onboard PMIC and ECC for Read and write plus double the channels, it will be costlier to manufacture.

In this context of X3D with HBM2e for example, 4 x 1 Hi stack (16Gb per die and 4096 bit wide, 4x1024) would not even need TSVs between stacked dies and the base die could already be an interposer. For this specific case of 8GB on chip memory it is basically four single DRAM dies and everything else is there. If they can hit those 3.8 Gbps (SK Hynix advertised max speed) or 4.1 Gbps (Samsung advertise max speed), its can be a mind boggling ~2 TB/sec.

Futhermore, if we want to take this concept further, a special SoC on N5 can have enough DRAM for GPU and system and has enough density to pack a GPU to boot. e.g. XSX SoC + RAM on N5 would be around 300mm2 which is manageable... went a bit ahead of myself here.

randomhero · Apr 29, 2020

DisEnchantment said:
Futhermore, if we want to take this concept further, a special SoC on N5 can have enough DRAM for GPU and system and has enough density to pack a GPU to boot. e.g. XSX SoC + RAM on N5 would be around 300mm2 which is manageable... went a bit ahead of myself here.

Do tell more

RetroZombie · Apr 29, 2020

DisEnchantment said:
Assuming 320mm^2 for XSX GPU with 56CUs (including disabled ones), and assuming Mem controllers scale lineraly with CU count (because why not )

Navi21 - 505mm^2 = 88.3CUs
Navi22 - 340mm^2 = 59.5CUs
Navi23 - 240mm^2 = 42CUs

And in that also the cpu cores, all i/o (pcie, usb, ...), special sound units, ms own mojo, ...

My math is different than yours, assuming a fixed 120mm2 for the uncore leaving the rest just for the units, would leave:
Navi21* - 505mm^2 = 100CUs
Navi22 - 340mm^2 = 64CUs
Navi23 - 240mm^2 = 40CUs

Used the XSX has reference with 200mm2 for units.
*Two different memory types supported on this one.

Glo. · Apr 30, 2020

RetroZombie said:
And in that also the cpu cores, all i/o (pcie, usb, ...), special sound units, ms own mojo, ...

My math is different than yours, assuming a fixed 120mm2 for the uncore leaving the rest just for the units, would leave:
Navi21* - 505mm^2 = 100CUs
Navi22 - 340mm^2 = 64CUs
Navi23 - 240mm^2 = 40CUs

Used the XSX has reference with 200mm2 for units.
*Two different memory types supported on this one.

Shouldn't it the be like this: Navi 23 - 48 CUs, Navi 22 - 64 CUs, Navi 21 - 96 CUs?

Also: Navi 23 - 48 CUs, 192 bit GDDR6 bit bus, Navi 22 - 64 CUs, 256 bit GDDR6 bus, Navi 21 - 96 CUs 384 GDDR6 bit bus?

Shouldn't it be this way? And this is only speculation. CU counts for consumer products are kept in deepest secret, for now.

randomhero · Apr 30, 2020

I am leaning towards 36,60 and 80 CUs with 256 and 384 bit buses.
All savings improved process brings will be gobbled up by additional RT hardware and probably cache.

DisEnchantment · Apr 30, 2020

Also what has really been very ambiguous is that AMD now suddenly calls everything 7nm. This is done on purpose for sure.
And also TSMC is calling N7, N7+, N7P, N6 as "7nm family" and a customer can call their products 7nm as long as it is one of these.
Additionally, there are HD and HP cell libraries to throw in the mix.

Now it is anybody's guess.

RetroZombie said:
And in that also the cpu cores, all i/o (pcie, usb, ...), special sound units, ms own mojo, ...

My math is different than yours, assuming a fixed 120mm2 for the uncore leaving the rest just for the units, would leave:
Navi21* - 505mm^2 = 100CUs
Navi22 - 340mm^2 = 64CUs
Navi23 - 240mm^2 = 40CUs

Used the XSX has reference with 200mm2 for units.
*Two different memory types supported on this one.

Indeed a more sensible approach is to get the XSX die shot and remove the rest except Mem controllers and CU arrays. but we don;t have.

randomhero said:
I am leaning towards 36,60 and 80 CUs with 256 and 384 bit buses.
All savings improved process brings will be gobbled up by additional RT hardware and probably cache.

RDNA2's RT HW is inside the CU. 4 each in one CU.
Cache config could indeed be different.

Question Speculation: RDNA2 + CDNA Architectures thread

Platinum Member

Diamond Member

Diamond Member

Diamond Member

Lifer

Platinum Member

Platinum Member

Golden Member

Senior member

Diamond Member

Senior member

Diamond Member

Member

Lifer

Member

Golden Member

Diamond Member

Diamond Member

Lifer

Golden Member

Member

Senior member

Diamond Member

Member

Golden Member