GPU Comparison
GEFORCE
NVIDIA A10M
CORE STATE
GA102
VRAM
20 GB
CLOCK SPEED
1635 MHz
TDP
150 W
BUS WIDTH
320 bit
ARCHITECTURE
Ampere
PROCESS
8 nm
LAUNCH DATE
—
VS
GEFORCE
H200 NVL
CORE STATE
GH100
VRAM
141 GB
CLOCK SPEED
1785 MHz
TDP
600 W
BUS WIDTH
6144 bit
ARCHITECTURE
Hopper
PROCESS
5 nm
LAUNCH DATE
2024
PERFORMANCE BENCHMARKS
geekbench_opencl
DETAILED SPECIFICATIONS
SPECIFICATION
A10M
H200 NVL
Core Specs
Shading Units
7,168
16,896
+135.7%
Shaders
7,168
16,896
+135.7%
TMUs
224
528
+135.7%
ROPs
80
24
-70.0%
SM Count
56
132
+135.7%
Clocks
Base Clock
975 MHz
1365 MHz
Boost Clock
1635 MHz
1785 MHz
Memory Clock
1563 MHz
12.5 Gbps effective
1593 MHz
6.4 Gbps effective
Memory
Memory Size
20 GB
141 GB
VRAM (MB)
20,480
144,384
+605.0%
Memory Type
GDDR6
HBM3e
Memory Bus
320 bit
6144 bit
Bandwidth
500.2 GB/s
4.89 TB/s
Cache
L1 Cache
128 KB (per SM)
256 KB (per SM)
L2 Cache
6 MB
50 MB
Performance
Pixel Rate
130.8 GPixel/s
42.84 GPixel/s
Texture Rate
366.2 GTexel/s
942.5 GTexel/s
FP32 (TFLOPS)
23.44 TFLOPS
60.32 TFLOPS
FP64 (TFLOPS)
732.5 GFLOPS (1:32)
30.16 TFLOPS (1:2)
FP16 (TFLOPS)
23.44 TFLOPS (1:1)
241.3 TFLOPS (4:1)
AI/RT
RT Cores
56
—
Tensor Cores
224
528
+135.7%
Power
TDP
150 W
600 W
TDP (W)
150
600
+300.0%
Suggested PSU
450 W
1000 W
Power Connectors
8-pin EPS
8-pin EPS
Architecture
Architecture
Ampere
Hopper
GPU Name
GA102
GH100
Generation
Server Ampere
(Axx)
Server Hopper
(Hxx)
Process Size
8 nm
5 nm
Transistors
28,300 million
80,000 million
Die Size
628 mm²
814 mm²
Foundry
Samsung
TSMC
Density
45.1M / mm²
98.3M / mm²
API Support
DirectX
12 Ultimate (12_2)
—
OpenGL
4.6
—
Vulkan
1.4
—
OpenCL
3.0
3.0
CUDA
8.6
9.0
Shader Model
6.8
—
Physical
Slot Width
Single-slot
Dual-slot
Length
267 mm
10.5 inches
267 mm
10.5 inches
Height
112 mm
4.4 inches
111 mm
4.4 inches
Outputs
No outputs
No outputs
Bus Interface
PCIe 4.0 x16
PCIe 5.0 x16
Other
Production
End-of-life
Active
Predecessor
Tesla Turing
Server Ada
Successor
Server Ada
Server Blackwell