Intel claims Ponte Vecchio HPC GPU is up to 2.5x faster than NVIDIA A100

Published: 22nd Aug 2022, 12:59 GMT   Comments

2-stack Ponte Vecchio GPU is faster than NVIDIA A100 according to Intel

At HotChip34 Intel is disclosing more details on its Ponte Vecchio Xe-HPC GPU. 

The company’s first data-center General Purpose GPU is built using 47 chiplets combining multiple architectures and nodes. It is by far the most sophisticated GPU Intel has ever built, but the architecture that has been pushed back numerous times.

Ponte Vecchio Memory & Throughput, Source: Intel HotChips34

Details disclosed at HotChips34 by Hong Jiang, Intel Fellow & Chief GPU Compute Architect, include the maximum theoretical throughput based on single-precision and double precision compute power for the 2-stack Ponte Vecchio. Tere are also figures for compute workloads accelerated by XMX cores, which are part of the Xe-HPC architecture.

Ponte Vecchio features Intel 7, TSMC N7 and N5 processes. It is built using Foveros and EMIB (multi-die interconnect bridge) 2.5D packaging technology. A single Ponte Vecchio features 128 Xe-Cores, 128 Ray Tracing Units and 64 MB and 408 MB of L1 and L2 caches respectively. This GPU is also equipped with up to 128 GB of HBM2e memory and supports industry-latest PCIe Gen5 interface.

Ponte Vecchio in DPC++ with SYCL & CUDA, Source: Intel HotChips34

With Data Parallel C++ (DPC++) Intel is claiming its Ponte Vecchio GPU is 1.4x to 2.5 times faster in some workloads. The company is also disclosing compute figures for ExaSMR OpenMC (Monte Carlo particle transport code) where Intel GPU offers twice the performance and for NekRS (Navier Stokes solver) it’s 1.3 to 1.7x faster.

Ponte Vecchio in ExaSMR & miniBUDE, Source: Intel HotChips34

This is not the first time Intel has been sharing performance figures for Ponte Vecchio. The launch of this new HPC GPU, however, is long overdue. Ponte Vecchio was meant to debut with Aurora Supercomputer alongside Sapphire Rapids Xeon CPUs, the US first exascale supercomputer. However, this title already belongs to Frontier equipped with AMD 3rd Gen EPYC CPUs and AMD Instinct MI250X GPUs (peak performance of 1.6 Exaflop).

2022-2023 HPC GPUs
VideoCardz.comNVIDIA H100 SXMAMD Instinct MI250X OAMIntel Ponte Vecchio OAMIntel Rialto Bridge OAM
Picture
GPUGH100Aldebaran (MCM)Ponte Vecchio (MCM)Rialto Bridge (MCM)
Transistors80B58.2B100BTBC
Die Size814 mm²2x ~790 mm²2x 640 mm²TBC
ArchitectureHopperCDNA2Xe-HPCXe-HPC
Fabrication NodeTSMC N4TSMC N6Intel 7, TSMC N5/N7Intel 4 (?)
GPU Clusters132 (SMs)220 (CUs)128 Xe-Cores160 Xe-Cores
L2 Cache50MB32MB408 MBTBC
Tensor/Matrix Cores5282x 440128160
Memory Bus5120-bit8192-bit8192-bit8192-bit (?)
Memory Size80 GB HBM3128GB HBM2e128GB HBM2eHBM3
TDP700W560W~600W~800W
Interface/Form FactorSXM5/PCIe Gen5OAM/PCIe Gen5OAM/PCIe Gen5OAM V2
Launch Year2022202120222023

Source: Wccftech




Comment Policy
  1. Comments must be written in English.
  2. Comments deemed to be spam or solely promotional in nature will be deleted. Including a link to relevant content is permitted, but comments should be relevant to the post topic. Discussions about politics are not allowed on this website.
  3. Comments and usernames containing language or concepts that could be deemed offensive will be deleted. Note this may include abusive, threatening, pornographic, offensive, misleading, or libelous language.
  4. Comments complaining about the article subject or its source will be removed.
  5. A failure to comply with these rules will result in a warning and, in extreme cases, a ban. Please also note that comments that attack or harass an individual directly will result in a ban without warning.
  6. VideoCardz Moderating Team reserves the right to edit or delete any comments submitted to the site without notice.
  7. If you have any questions about the commenting policy, please let us know through the Contact Page.
Hide Comment Policy
Comments