GPU Comparison
RADEON
AMD Radeon Pro Vega 64
CORE STATE
Vega 10
VRAM
16 GB
CLOCK SPEED
1350 MHz
TDP
250 W
BUS WIDTH
2048 bit
ARCHITECTURE
GCN 5.0
PROCESS
14 nm
LAUNCH DATE
2017
VS
GEFORCE
RTX A4000
CORE STATE
GA104
VRAM
16 GB
CLOCK SPEED
1560 MHz
TDP
140 W
BUS WIDTH
256 bit
ARCHITECTURE
Ampere
PROCESS
8 nm
LAUNCH DATE
2021
PERFORMANCE BENCHMARKS
geekbench_metal
geekbench_opencl
geekbench_vulkan
3dmark_3dmark_steel_nomad_dx12
passmark_directx_10
passmark_directx_11
passmark_directx_12
passmark_directx_9
passmark_g2d
passmark_g3d
passmark_gpu_compute
DETAILED SPECIFICATIONS
SPECIFICATION
Pro Vega 64
RTX A4000
Core Specs
Shading Units
4,096
6,144
+50.0%
Shaders
4,096
6,144
+50.0%
TMUs
256
192
-25.0%
ROPs
64
96
+50.0%
Compute Units
64
—
SM Count
—
48
Clocks
Base Clock
1250 MHz
735 MHz
Boost Clock
1350 MHz
1560 MHz
Memory Clock
786 MHz
1572 Mbps effective
1750 MHz
14 Gbps effective
Memory
Memory Size
16 GB
16 GB
VRAM (MB)
16,384
16,384
0.0%
Memory Type
HBM2
GDDR6
Memory Bus
2048 bit
256 bit
Bandwidth
402.4 GB/s
448.0 GB/s
Cache
L1 Cache
16 KB (per CU)
128 KB (per SM)
L2 Cache
4 MB
4 MB
Performance
Pixel Rate
86.40 GPixel/s
149.8 GPixel/s
Texture Rate
345.6 GTexel/s
299.5 GTexel/s
FP32 (TFLOPS)
11.06 TFLOPS
19.17 TFLOPS
FP64 (TFLOPS)
691.2 GFLOPS (1:16)
299.5 GFLOPS (1:64)
FP16 (TFLOPS)
22.12 TFLOPS (2:1)
19.17 TFLOPS (1:1)
AI/RT
RT Cores
—
48
Tensor Cores
—
192
Power
TDP
250 W
140 W
TDP (W)
250
140
-44.0%
Suggested PSU
—
300 W
Power Connectors
None
1x 6-pin
Architecture
Architecture
GCN 5.0
Ampere
GPU Name
Vega 10
GA104
Generation
Radeon Pro Mac
(Vega Series)
Workstation Ampere
(Ax000)
Process Size
14 nm
8 nm
Transistors
12,500 million
17,400 million
Die Size
495 mm²
392 mm²
Foundry
GlobalFoundries
Samsung
Density
25.3M / mm²
44.4M / mm²
API Support
DirectX
12 (12_1)
12 Ultimate (12_2)
OpenGL
4.6
4.6
Vulkan
1.3
1.4
OpenCL
2.1
3.0
CUDA
—
8.6
Shader Model
6.7
6.8
Physical
Slot Width
IGP
Single-slot
Length
—
241 mm
9.5 inches
Height
—
112 mm
4.4 inches
Outputs
Portable Device Dependent
4x DisplayPort 1.4a
Bus Interface
PCIe 3.0 x16
PCIe 4.0 x16
Other
Production
End-of-life
End-of-life
Predecessor
—
Quadro Turing
Successor
—
Workstation Ada