It appears I’ve just discovered first Polaris 11 device in CompuBench database.
Polaris 11 “67FF” with 16 Compute Units
So far Polaris 11 was quite a mystery. The smaller Polaris chip that was shown competing with GTX 950 back in January, was not in the spotlight two months later at Capsaicin event. This silicon is expected to replace Pitcairn-based products with more power efficient solution.
AMD Polaris 11, Device ID 67FF:C8 codenamed “Goose”
The word ‘SKU’ in the post title is intentional, as we know that Polaris 11 will launch in many different configurations.
It’s important, because CompuBench shows active Compute Units, so for instance GTX 970 is shown with 13 active units. Therefore, it does not mean that full Polaris 11 silicon has 16 CUs, but this particular SKU does.
Polaris 11 OpenCL device information (CompuBench) / information according to Khronos
If 16 Compute Units are correct, and Polaris GCN 4.0 architecture still has 64 Stream Processors per CU, then Polaris 11 67FF SKU should feature 1024 unified cores.
You might wonder why it’s only 1024 SP, but let me remind you that new FinFET architectures have much higher transistor density per mm2. For example, Pascal GP100 has twice as many transistors as Maxwell GM200. Plus, AMD’s Polaris is using even smaller fabrication node than Pascal (14nm vs 16nm). In other words, those unified cores (CUDAs/Stream Processors) will be much more efficient than its predecessors.
To illustrate what is known to this day about Polaris and Vega, I made this table.
Please note, specs of Polaris 10 67DF and Polaris 67FF are not necessarily showing full silicons.
AMD Polaris / Vega GPU Specifications | |||
---|---|---|---|
April 8th, 2016 | AMD Vega 10 | AMD Polaris 10 (67DF) | AMD Polaris 11 (67FF) |
GPU | Vega 10 / Greenland | Polaris 10 / Ellesmere | Polaris 11 / Baffin |
Positioning | Enthusiast | High-end | Mid-range |
Fabrication Process | 14nm FinFET | 14nm FinFET | 14nm FinFET |
Compute Units | 64 | 36+ | 16+ |
Stream Processors | 4096 | 2304+ | 1024+ |
Computing Power | ~8.2 TFLOPs | ~3.7 TFLOPs | ~ 2.0 TFLOPs |
Core clock | ~1000 MHz | ~800 MHz | ~1000 MHz |
Effective Memory Clock | ~2000 MHz | ~6000 MHz | ~7000 MHz |
Memory Bus | 4096-bit | 256-bit | 128-bit |
Memory | 16GB HBM2 | 8GB GDDR5 | 4GB GDDR5 |
Bandwidth | 1024 GB/s | 192 GB/s | 112 GB/s |
Launch Date | Q4’16 – Q1’17 | Q2’16 | Q2’16 |
AMD Polaris 11 vs GeForce GTX 950 OpenCL performance
This is the first benchmark of Polaris 11 at CompuBench, so I find it unreasonable to compare those results to anything. Still, just to give you some idea on how it performs at this stage, here’s a comparison between Polaris11 and GTX 950 in OpenCL benchmark:
CES2016 Polaris 11 demonstration
Back in January, when AMD announced Polaris architecture, were were told that Radeon card based on Polaris 11 can offer stable 60 frames per second in Star Wars Battlefront, while the whole system drew just 86W from power socket. Meanwhile system with GeForce GTX 950 delivering the same framerate required 140W.
Here’s a video from PCPerspective showing this demo: