GPU Comparison
GEFORCE
NVIDIA CMP 70HX
CORE STATE
GA104
VRAM
8 GB
CLOCK SPEED
1395 MHz
TDP
—
BUS WIDTH
256 bit
ARCHITECTURE
Ampere
PROCESS
8 nm
LAUNCH DATE
—
VS
GEFORCE
L40
CORE STATE
AD102
VRAM
48 GB
CLOCK SPEED
2490 MHz
TDP
300 W
BUS WIDTH
384 bit
ARCHITECTURE
Ada Lovelace
PROCESS
5 nm
LAUNCH DATE
2022
PERFORMANCE BENCHMARKS
geekbench_opencl
geekbench_vulkan
DETAILED SPECIFICATIONS
SPECIFICATION
CMP 70HX
L40
Core Specs
Shading Units
3,840
18,176
+373.3%
Shaders
3,840
18,176
+373.3%
TMUs
120
568
+373.3%
ROPs
64
192
+200.0%
SM Count
30
142
+373.3%
Clocks
Base Clock
1365 MHz
735 MHz
Boost Clock
1395 MHz
2490 MHz
Memory Clock
1188 MHz
19 Gbps effective
2250 MHz
18 Gbps effective
Memory
Memory Size
8 GB
48 GB
VRAM (MB)
8,192
49,152
+500.0%
Memory Type
GDDR6X
GDDR6
Memory Bus
256 bit
384 bit
Bandwidth
608.3 GB/s
864.0 GB/s
Cache
L1 Cache
128 KB (per SM)
128 KB (per SM)
L2 Cache
4 MB
96 MB
Performance
Pixel Rate
89.28 GPixel/s
478.1 GPixel/s
Texture Rate
167.4 GTexel/s
1,414.3 GTexel/s
FP32 (TFLOPS)
10.71 TFLOPS
90.52 TFLOPS
FP64 (TFLOPS)
167.4 GFLOPS (1:64)
1,414.3 GFLOPS (1:64)
FP16 (TFLOPS)
10.71 TFLOPS (1:1)
90.52 TFLOPS (1:1)
AI/RT
RT Cores
30
142
+373.3%
Tensor Cores
120
568
+373.3%
Power
TDP
—
300 W
TDP (W)
—
300
Suggested PSU
200 W
700 W
Power Connectors
1x 12-pin
1x 16-pin
Architecture
Architecture
Ampere
Ada Lovelace
GPU Name
GA104
AD102
Generation
Mining GPUs
Server Ada
(Lxx)
Process Size
8 nm
5 nm
Transistors
17,400 million
76,300 million
Die Size
392 mm²
609 mm²
Foundry
Samsung
TSMC
Density
44.4M / mm²
125.3M / mm²
API Support
DirectX
12 Ultimate (12_2)
12 Ultimate (12_2)
OpenGL
4.6
4.6
Vulkan
1.4
1.4
OpenCL
3.0
3.0
CUDA
8.6
8.9
Shader Model
6.8
6.8
Physical
Slot Width
Dual-slot
Dual-slot
Length
267 mm
10.5 inches
267 mm
10.5 inches
Height
112 mm
4.4 inches
111 mm
4.4 inches
Outputs
No outputs
4x DisplayPort 1.4a
Bus Interface
PCIe 1.0 x4
PCIe 4.0 x16
Other
Production
End-of-life
End-of-life
Predecessor
—
Server Ampere
Successor
—
Server Hopper