What would a 4 thread CCX look like in a Nosta APU?
Two Cluster-based Multithreading modules with two clusters/cores each or a single cluster-based multithreading module with four clusters/cores.
What's the advantage of smaller CU?
Lower power consumption, higher possible frequencies, with the same performance or negligible loss in performance. The Small CU has better performance per mm squared than the Standard CU.
a4-9120C => 600 MHz
22FDX w/ SmCU => 1.2 GHz
then, there is added boost.
22FDX successor of a6-9220C => 1.2 GHz that boosts up to 1.7 GHz
22FDX successor of a4-9120C => 1.2 GHz that boosts up to 1.4 GHz
DDR4-3600 with maximum DDR5 being 50% added to that..
DDR4-3600 maximum for DDR4 and DDR5-5400 maximum for DDR5.
We'll have to take 3 small CUs are better than 6 standard CUs. Specifically, for a 22FDX processor.
/22FDX Transistors
/New Compute Unit
/New Circuits/Cells for Logic+File+Cache
/IVR+AVFS improvements from FDSOI
]To get the 2x increase.
Then, the boost comes from extra stuff, like FBB/RBB.
Could a CU be used for FPU calculations? They had a whole HSA concept 5+ years ago. I think they should get back to it now that Zen is so well along and can take the backburner a bit.
Yes, but it goes through a decode process twice.
CPU core; ALU + AGU + FPU + FCU
FCU => Forwarder to Compute Unit;
The FCU's job is to make Graphic Core Next instructions from x86 FPU instructions. It is implied these x86 FPU ops would be a separate prefix than standard FPU ops.
In the more modern cases however, the FCU is integrated to the CPU front-end and the GCN back-end(TMUs/Graphic L1D/RBE) is integrated to the CPU back-end.
It is unlikely to get either options.