AMD Instinct MI200 to feature 128GB per GPU

Published: Jul 5th 2021, 20:28 GMT   Comments

AMD Mi-Next GPUs to feature 128GB per GPU

Pawsey SuperComputing is to build Setonix supercomputer featuring AMD Milan CPUs and Mi-Next GPUs.

At ISC 2021 Ugo Varetto, Pawsey’s CTO, revealed that Powsey’s supercomputer named Setonix features as many as 200,000+ AMD Milan CPU Cores and as many as 750+ AMD Mi-Next GPUs. This announcement wouldn’t be that important if not for the fact that Ugo Varetto revealed that each of those GPUs will feature 128GB of memory.

The supercomputer which was acquired thanks to a 70 million dollars funding will improve Pawsey’s work in data ingestions, data visualization, data lifecycle management, and data sharing, or as it was put by HPCWire: data work. Operating or large data sets require a large memory pool, which in combination with GPU acceleration can greatly improve the efficiency of the system.

Pawsey’s SC Setonix supercomputer, Source: Ugo Varetto

AMD Mi-Next, namely Instinct MI200 will undoubtedly be the largest GPU AMD has made so far. The GPU known as Aldebaran is to features Multi-Chip-Module (MCM) design. According to Pawsey’s announcement, this GPU will feature 128GB of HBM2 memory, which is four times more than its predecessor, the MI100.

Just a few days ago a block diagram of the MI200 GPU has been created by Locuza. It demonstrates how eight stacks of HBM2e memory would be attached to both of the GPU dies:

AMD Aldebaran GPU Block Diagram, Source: @Locuza_

AMD has so far not confirmed the specifications of its MI200 accelerator, but as far as leaks are concerned the full Aldebaran GPU features 128 Compute Units. The number of active CUs on MI200 specifically has not been confirmed yet though.

AMD Instinct Accelerators
Accelerator NameAMD Radeon Instinct MI60AMD Instinct MI100AMD Instinct MI200
Architecture7nm GCN57nm CDNA1 (GFX908)CDNA2 (GFX90A)
GPUVega 20ArcturusAldebaran (MCM)
GPU Cores40967680TBC
GPU Clock Speed1800 MHz~1500 MHzTBC
FP16 Compute29.5 TFLOPs185 TFLOPsTBC
FP32 Compute14.7 TFLOPs23.1 TFLOPsTBC
FP64 Compute7.4 TFLOPs11.5 TFLOPsTBC
Memory Clock1000 MHz1200 MHzTBC
Memory Bus4096-bit bus4096-bit busTBC
Memory Bandwidth1 TB/s1.23 TB/sTBC
Form FactorDual Slot, Full LengthDual Slot, Full LengthOAM
CoolingPassive CoolingPassive CoolingTBC

Source: HPCWire via Reddit

Comment Policy
  1. Comments must be written in English.
  2. Comments deemed to be spam or solely promotional in nature will be deleted. Including a link to relevant content is permitted, but comments should be relevant to the post topic. Discussions about politics are not allowed on this website.
  3. Comments and usernames containing language or concepts that could be deemed offensive will be deleted.
  4. Comments complaining about the post subject or its source will be removed.
  5. A failure to comply with these rules will result in a warning and, in extreme cases, a ban. In addition, please note that comments that attack or harass an individual directly will result in a ban without warning.
  6. VideoCardz has never been sponsored by AMD, Intel, or NVIDIA. Users claiming otherwise will be banned.
  7. VideoCardz Moderating Team reserves the right to edit or delete any comments submitted to the site without notice.
  8. If you have any questions about the commenting policy, please let us know through the Contact Page.
Hide Comment Policy