NVIDIA confirms Ada 102/103/104 GPU specs, AD104 has more transistors than GA102

Published: Sep 23rd 2022, 07:01 GMT   Comments

NVIDIA Ada GPUs have significantly higher ROP count

NVIDIA is clarifying the specs for RTX 40 series.

The company released full info on the die sizes and transistor counts on AD102, AD103 and AD104 GPUs. All three are to launch in the following weeks. NVIDIA already provided important figures for AD102 GPU, the flagship processor intended for RTX 4090 graphics card, but the details on AD104 and AD103 were still missing. Ryan Smith from AnandTech reports on the exact figures:

  • AD102: 608 mm² die, 76.3B transistors
  • AD103: 378.6 mm² die, 45.9B transistors
  • AD104: 294.5 mm² die, 35.8B transistors

What this means is that all three xtor density higher than 121M per square mm (it is actually identical for AD103 and AD104). Furthermore, AD104 with 35.8B transistors means it has 7.5B transistors more than Ampere GA102 GPU flagship (28.3B). To put that into perspective, GA102 is more than twice as large as AD104.

ArchitectureAda LovelaceAda LovelaceAda Lovelace
Process NodeTSMC 4N (5nm)TSMC 4N (5nm)TSMC 4N (5nm)
Die Size608 mm²378.6 mm²294.5 mm²
Transistor Density125.5M121.1M121.1M
Streaming Multiprocessors1448060
CUDA Cores18432102407680
Tensor Cores576320240
RT Cores1448060
L2 Cache96MB64MB48MB
SKURTX 4090RTX 4080 16GBRTX 4080 12GB

NVIDIA Ada GPUs have a much higher count of Render Output Unit (ROP) than the predecessor, going up to 192 ROPs for AD102. The AD103 GPU has just as many ROPs as GA102 (112), while AD104 had 80. Higher ROP count should improve rasterization performance.

NVIDIA has introduced some changes to the architecture, such as removal of NVLink, as explained, to make room for other logical blocks. But at the same time, the L2 cache has significantly increased. NVIDIA has now confirmed the exact size for each SKU: AD102 96MB, AD103 64MB and AD104 48MB. It is confirmed that both RTX 4080 models have fully unlocked L2 cache on respective GPUs, so 4080 16GB has 64MB while 4080 12GB comes with 48MB.

Furthermore, HKEPC reports that NVIDIA also clarified what TSMC 4N really means, which is not to be confused with N4. This process is a die shrink of TSMC 5N process, but it is still a 5nm architecture. The only problem with this ‘clarification’ is that NVIDIA themselves provide wrong information on 4nm process, as shown below (slide from this week’s Editors Day).


Source: Ryan Smith (AnandTech), HKEPC

Comment Policy
  1. Comments must be written in English and should not exceed 1000 characters.
  2. Comments deemed to be spam or solely promotional in nature will be deleted. Including a link to relevant content is permitted, but comments should be relevant to the post topic. Discussions about politics are not allowed on this website.
  3. Comments and usernames containing language or concepts that could be deemed offensive will be deleted.
  4. Comments complaining about the post subject or its source will be removed.
  5. A failure to comply with these rules will result in a warning and, in extreme cases, a ban. In addition, please note that comments that attack or harass an individual directly will result in a ban without warning.
  6. VideoCardz has never been sponsored by AMD, Intel, or NVIDIA. Users claiming otherwise will be banned.
  7. VideoCardz Moderating Team reserves the right to edit or delete any comments submitted to the site without notice.
  8. If you have any questions about the commenting policy, please let us know through the Contact Page.
Hide Comment Policy