AMD Aldebaran might be dual-die MCM GPU
AMD MI200 had only just appeared in Linux patches, but due to the extensivity of the data, we are still learning new details hidden within the code. A patch from February 24th appears to confirm that Aldebaran will be a dual-die GPU. This was discovered by Coelacanth’s Dream. The patch clearly refers to alde_die_0 and alde_die_1, indicating that the chip is dual-die. In this case, “Alde” refers to Aldebaran, the upcoming compute accelerator for Instinct MI200, not Intel Alder Lake.
AMD Aldebaran in Linux patch, Source: Freedesktop
Aldebaran will compete with Intel Xe-HP(C) and NVIDIA Hopper as the first multi-chip module graphics processors, or more precisely compute accelerators. The same patches have already indicated that the MI200 will have an updated variant of High-Bandwidth-Memory called HBM2E. This means that AMD does not need to increase the number of stacks, but rather add more DRAM layers to increase capacity.
The multi-chip module will require new chiplets to be added to the package, such as the I/O module, which could theoretically be infused into each GPU module (like we had with the first-gen EPYC), but that seems rather unlikely.
Monolithic vs MCM design, Source: VideoCardz
AMD continues to claim that CDNA2-based GPU will be manufactured using “Advanced Node”. This likely means that each chiplets will be using different nodes, such as 7nm and 5nm within the same package. MCM GPUs such as Aldebaran will shape the future of graphics accelerators. It may take a while to see MCM in gaming GPUs though.
AMD CDNA Roadmap, Source: AMD
|AMD Instinct Accelerators|
|Accelerator Name||AMD Radeon Instinct MI60||AMD Instinct MI100||AMD Instinct MI200|
|Architecture||7nm GCN5||7nm CDNA1 (GFX908)||CDNA2 (GFX90A)|
|GPU||Vega 20||Arcturus||Aldebaran (MCM)|
|GPU Clock Speed||1800 MHz||~1500 MHz||TBC|
|FP16 Compute||29.5 TFLOPs||185 TFLOPs||TBC|
|FP32 Compute||14.7 TFLOPs||23.1 TFLOPs||TBC|
|FP64 Compute||7.4 TFLOPs||11.5 TFLOPs||TBC|
|VRAM||32 GB HBM2||32 GB HBM2||HBM2E|
|Memory Clock||1000 MHz||1200 MHz||TBC|
|Memory Bus||4096-bit bus||4096-bit bus||TBC|
|Memory Bandwidth||1 TB/s||1.23 TB/s||TBC|
|Form Factor||Dual Slot, Full Length||Dual Slot, Full Length||OAM|
|Cooling||Passive Cooling||Passive Cooling||TBC|