GPU Comparison
RADEON
AMD Radeon Pro 555X
CORE STATE
Polaris 21
VRAM
4 GB
CLOCK SPEED
—
TDP
75 W
BUS WIDTH
128 bit
ARCHITECTURE
GCN 4.0
PROCESS
14 nm
LAUNCH DATE
2018
VS
GEFORCE
RTX A4000
CORE STATE
GA104
VRAM
16 GB
CLOCK SPEED
1560 MHz
TDP
140 W
BUS WIDTH
256 bit
ARCHITECTURE
Ampere
PROCESS
8 nm
LAUNCH DATE
2021
PERFORMANCE BENCHMARKS
geekbench_metal
geekbench_opencl
geekbench_vulkan
3dmark_3dmark_steel_nomad_dx12
passmark_directx_10
passmark_directx_11
passmark_directx_12
passmark_directx_9
passmark_g2d
passmark_g3d
passmark_gpu_compute
DETAILED SPECIFICATIONS
SPECIFICATION
Pro 555X
RTX A4000
Core Specs
Shading Units
768
6,144
+700.0%
Shaders
768
6,144
+700.0%
TMUs
48
192
+300.0%
ROPs
16
96
+500.0%
Compute Units
12
—
SM Count
—
48
Clocks
Base Clock
—
735 MHz
Boost Clock
—
1560 MHz
GPU Clock
907 MHz
—
Memory Clock
1470 MHz
5.9 Gbps effective
1750 MHz
14 Gbps effective
Memory
Memory Size
4 GB
16 GB
VRAM (MB)
4,096
16,384
+300.0%
Memory Type
GDDR5
GDDR6
Memory Bus
128 bit
256 bit
Bandwidth
94.08 GB/s
448.0 GB/s
Cache
L1 Cache
16 KB (per CU)
128 KB (per SM)
L2 Cache
1024 KB
4 MB
Performance
Pixel Rate
14.51 GPixel/s
149.8 GPixel/s
Texture Rate
43.54 GTexel/s
299.5 GTexel/s
FP32 (TFLOPS)
1,393.2 GFLOPS
19.17 TFLOPS
FP64 (TFLOPS)
87.07 GFLOPS (1:16)
299.5 GFLOPS (1:64)
FP16 (TFLOPS)
1,393.2 GFLOPS (1:1)
19.17 TFLOPS (1:1)
AI/RT
RT Cores
—
48
Tensor Cores
—
192
Power
TDP
75 W
140 W
TDP (W)
75
140
+86.7%
Suggested PSU
—
300 W
Power Connectors
None
1x 6-pin
Architecture
Architecture
GCN 4.0
Ampere
GPU Name
Polaris 21
GA104
Generation
Radeon Pro Mac
(500X Series)
Workstation Ampere
(Ax000)
Process Size
14 nm
8 nm
Transistors
3,000 million
17,400 million
Die Size
123 mm²
392 mm²
Foundry
GlobalFoundries
Samsung
Density
24.4M / mm²
44.4M / mm²
API Support
DirectX
12 (12_0)
12 Ultimate (12_2)
OpenGL
4.6
4.6
Vulkan
1.3
1.4
OpenCL
2.1
3.0
CUDA
—
8.6
Shader Model
6.7
6.8
Physical
Slot Width
IGP
Single-slot
Length
—
241 mm
9.5 inches
Height
—
112 mm
4.4 inches
Outputs
Portable Device Dependent
4x DisplayPort 1.4a
Bus Interface
PCIe 3.0 x8
PCIe 4.0 x16
Other
Production
End-of-life
End-of-life
Predecessor
—
Quadro Turing
Successor
—
Workstation Ada