AMD shows off its MI300 at CES 2023
The next generation data-center processor has been confirmed to feature up to 24 Zen4 cores.
AMD Instinct MI300 presentation, Source: AMD
AMD wrapped up the CES 2023 keynote by introducing new details on its ‘world’s first data center integrated CPU + GPU’. This concept has been proven useful for the consumer market, by combining CPU and GPU cores on a single die, reducing the footprint of such integration and increasing efficiency.
AMD MI300 will take things to a new level, by using 3D die stacking technology and combining the CPU/GPU cores with high-speed memory. The MI300 is confirmed to feature up to 128GB of HBM3 memory, which can actually allow the CPU to be used without external memory.
AMD Instinct MI300 presentation, Source: AMD
In AMD CEO own words, AMD’s new Instinct combines up to 24 EPYC Zen4 cores and CDNA3 compute architecture within nine 5nm chiplets placed on top of four 6nm chiplets. This technology will be the first time AMD is using 3D die stacking. Dr. Lisa Su confirmed that the chip is already in the labs and has showcased one of the chips during the keynote. The processor is officially coming in the second half of this year.
AMD Instinct MI300 presentation, Source: AMD
The MI300 ‘APU’ is a successor to MI250 based on CDNA2 architecture, which was released last year. AMD is not providing any performance figures for its CDNA3 based chip yet, therefore comparisons are not possible. But this will be undoubtedly a massive change to AMD EPYC/Instinct product stack regardless.
AMD Instinct Accelerators Specifications | ||||
---|---|---|---|---|
VideoCardz | AMD Radeon Instinct MI60 | AMD Instinct MI100 | AMD Instinct MI250X | AMD Instinct MI300 |
Launch Year | 2018 | 2020 | 2022 | 2023 |
Fabrication Nodes | 7nm | 7nm | 6nm MCM | 5nm + 6nm 3D Die Stacking |
CPU | – | – | – | up to 24 Zen4 cores |
GPU | Vega 20 GCN5 (GFX906) | Arcturus CDNA1 (GFX908) | Aldebaran CDNA2 (GFX90A) | CDNA3 (GFX940) |
Base Chiplets | – | – | – | up to 2 |
Compute Tiles | 1 | 1 | 2 | up to 8 |
Compute Units | 64 | 120 | 220 | TBC |
GPU Clock Speed | 1800 MHz | ~1500 MHz | ~1700 MHz | TBC |
FP16 Compute | 29.5 TFLOPS | 185 TFLOPS | 383 TFLOPS | TBC |
FP32 Compute | 14.7 TFLOPS | 23.1 TFLOPS | 47.9 TFLOPS | TBC |
FP64 Compute | 7.4 TFLOPS | 11.5 TFLOPS | 47.9 TFLOPS | TBC |
VRAM | 32 GB HBM2 | 32 GB HBM2 | 128 GB HBM2e | 128GB HBM3 |
Memory Clock | 2.0 Gbps | 2.4 Gbps | 3.2 Gbps | TBC |
Memory Bus | 4096-bit | 4096-bit | 8192-bit | up to 8192-bit |
Memory Bandwidth | 1 TB/s | 1.23 TB/s | 3.2 TB/s | TBC |
Form Factor | Dual Slot, Full Length | Dual Slot, Full Length | OAM | OAM |
TDP | 300W | 300W | 560W | up to 600W+ |
Source: AMD