AMD Navi31 flagship RDNA3 GPU could hit 92 TFLOPS of FP32 performance, four times more than Navi21

Published: 30th Apr 2022, 13:45 GMT   Comments

Please note that this post is tagged as a rumor.

Update:

Greymon55 clarified that the 92 TFLOPS claim was referring to the old, rumored 15360 Stream Processor configuration. He has since provided an update now claiming that the core count is 12288 Stream Processors. This means that the card with 3 GHz clock would feature 73 TFLOPS, not 92 as he claimed.

AMD Radeon RX 7900XT could potentially feature 3 GHz GPU clock

AMD leaker Greymon55 provides an update to half a year rumors on Navi 31 GPU.

The leaker now claims that the GPU might hit 92 TFLOPS of FP32 performance, which is 28% more than the previous claim of 75 TFLOPS. This new number is also four times as high as RDNA2 based Navi 21 GPU (RX 6900XT) model.

Greymon55 earlier mentioned that there is a major change to the specs that were shared for more than a year, but has not explained what those changes are. However, should AMD keep 60 WorkGroup Processor (WGP) and 15360 Stream Processor configuration, such single-precision compute performance would require 2995 MHz GPU clock. However, it is very likely that similarly to NVIDIA and their Ada architecture, AMD will be introducing a matrix of different 32-bit instructions that result in higher floating point performance.

Furthermore, contrary to previous rumors, AMD might still keep Compute Units for its RDNA3 (GFX11) architecture. This is suggested by the recent changes to AMD GPU driver discovered by @Kepler_L2:

AMDGPU driver,Source: Freedesktop

AMD Radeon RX 7000 series are now expected to launch by the end of this year. The Navi 31 GPU should be among the first to be announced by AMD, most likely for RX 7900 XT class graphics card, the new flagship consumer SKU.

Next-gen Flagship GPU Comparison (RUMORED)
VideoCardz.comGeForce RTX 4090 – ClassRadeon RX 7900 – ClassArc B9XX – Class
Fabrication NodeTSMC N5TSMC N5/N6TBC
ArchitectureNVIDIA AdaAMD RDNA3Intel Xe2-HPG “Battlemage”
GPU PackageMonolithicMulti-Chip-Module (MCM)Multi-Chip-Module (MCM)
Estimated GPU Size~600mm²~800mm²TBC
Graphics Dies12 GCD + 4 MCD + 1 IOD4 Tiles
GPU Mega Clusters12 Graphics Processing Clusters (GPC)2×3 Shader Engines10 Render Slices
GPU Super Clusters72 Texture Processing Clusters (TPC)2×30 RDNA Workgroups (WGP)40 Xe-Cores
GPU Clusters144 Streaming Multiprocessors (SM)120 Compute Units1280 Xe Vector Engines
FP32 Cores18432 CUDA Cores15360 Stream Processors20480 FP32 ALUs
GPU Clock~ 2.7 GHz~ 3.0 GHzTBC
Memory Type24 GB GDDR6XTBC GB GDDR6TBC
Memory & Bus21 Gbps 384-bitTBC Gbps 256-bitTBC
Cache96MB (L2 Cache)256 or 512MB Infinity CacheTBC
Power Consumption600W (TDP)TBCTBC
Release DateQ3/Q4 2022Q3/Q4 20222023-2024
Rumored Performance~ 100 TFLOPs FP32 *~ 92 TFLOPs FP32TBC

* Updated as Greymon55 suggests FP32 performance of AD102 GPU could reach 100 TFLOPS.

Source: @Greymon_55, @Kepler_L2




Comment Policy
  1. Comments must be written in English.
  2. Comments deemed to be spam or solely promotional in nature will be deleted. Including a link to relevant content is permitted, but comments should be relevant to the post topic. Discussions about politics are not allowed on this website.
  3. Comments and usernames containing language or concepts that could be deemed offensive will be deleted. Note this may include abusive, threatening, pornographic, offensive, misleading, or libelous language.
  4. Comments complaining about the article subject or its source will be removed.
  5. A failure to comply with these rules will result in a warning and, in extreme cases, a ban. Please also note that comments that attack or harass an individual directly will result in a ban without warning.
  6. VideoCardz Moderating Team reserves the right to edit or delete any comments submitted to the site without notice.
  7. If you have any questions about the commenting policy, please let us know through the Contact Page.
Hide Comment Policy
Comments