GPU Comparison
RADEON
AMD Radeon Pro Vega 64
CORE STATE
Vega 10
VRAM
16 GB
CLOCK SPEED
1350 MHz
TDP
250 W
BUS WIDTH
2048 bit
ARCHITECTURE
GCN 5.0
PROCESS
14 nm
LAUNCH DATE
2017
VS
GEFORCE
H200 NVL
CORE STATE
GH100
VRAM
141 GB
CLOCK SPEED
1785 MHz
TDP
600 W
BUS WIDTH
6144 bit
ARCHITECTURE
Hopper
PROCESS
5 nm
LAUNCH DATE
2024
PERFORMANCE BENCHMARKS
geekbench_metal
geekbench_opencl
geekbench_vulkan
DETAILED SPECIFICATIONS
SPECIFICATION
Pro Vega 64
H200 NVL
Core Specs
Shading Units
4,096
16,896
+312.5%
Shaders
4,096
16,896
+312.5%
TMUs
256
528
+106.3%
ROPs
64
24
-62.5%
Compute Units
64
—
SM Count
—
132
Clocks
Base Clock
1250 MHz
1365 MHz
Boost Clock
1350 MHz
1785 MHz
Memory Clock
786 MHz
1572 Mbps effective
1593 MHz
6.4 Gbps effective
Memory
Memory Size
16 GB
141 GB
VRAM (MB)
16,384
144,384
+781.3%
Memory Type
HBM2
HBM3e
Memory Bus
2048 bit
6144 bit
Bandwidth
402.4 GB/s
4.89 TB/s
Cache
L1 Cache
16 KB (per CU)
256 KB (per SM)
L2 Cache
4 MB
50 MB
Performance
Pixel Rate
86.40 GPixel/s
42.84 GPixel/s
Texture Rate
345.6 GTexel/s
942.5 GTexel/s
FP32 (TFLOPS)
11.06 TFLOPS
60.32 TFLOPS
FP64 (TFLOPS)
691.2 GFLOPS (1:16)
30.16 TFLOPS (1:2)
FP16 (TFLOPS)
22.12 TFLOPS (2:1)
241.3 TFLOPS (4:1)
AI/RT
Tensor Cores
—
528
Power
TDP
250 W
600 W
TDP (W)
250
600
+140.0%
Suggested PSU
—
1000 W
Power Connectors
None
8-pin EPS
Architecture
Architecture
GCN 5.0
Hopper
GPU Name
Vega 10
GH100
Generation
Radeon Pro Mac
(Vega Series)
Server Hopper
(Hxx)
Process Size
14 nm
5 nm
Transistors
12,500 million
80,000 million
Die Size
495 mm²
814 mm²
Foundry
GlobalFoundries
TSMC
Density
25.3M / mm²
98.3M / mm²
API Support
DirectX
12 (12_1)
—
OpenGL
4.6
—
Vulkan
1.3
—
OpenCL
2.1
3.0
CUDA
—
9.0
Shader Model
6.7
—
Physical
Slot Width
IGP
Dual-slot
Length
—
267 mm
10.5 inches
Height
—
111 mm
4.4 inches
Outputs
Portable Device Dependent
No outputs
Bus Interface
PCIe 3.0 x16
PCIe 5.0 x16
Other
Production
End-of-life
Active
Predecessor
—
Server Ada
Successor
—
Server Blackwell