GPU Comparison
GEFORCE
NVIDIA CMP 40HX
CORE STATE
TU106
VRAM
8 GB
CLOCK SPEED
1650 MHz
TDP
185 W
BUS WIDTH
256 bit
ARCHITECTURE
Turing
PROCESS
12 nm
LAUNCH DATE
2021
VS
GEFORCE
L20
CORE STATE
AD102
VRAM
48 GB
CLOCK SPEED
2520 MHz
TDP
275 W
BUS WIDTH
384 bit
ARCHITECTURE
Ada Lovelace
PROCESS
5 nm
LAUNCH DATE
2023
PERFORMANCE BENCHMARKS
geekbench_opencl
geekbench_vulkan
DETAILED SPECIFICATIONS
SPECIFICATION
CMP 40HX
L20
Core Specs
Shading Units
2,304
11,776
+411.1%
Shaders
2,304
11,776
+411.1%
TMUs
144
368
+155.6%
ROPs
64
128
+100.0%
SM Count
36
92
+155.6%
Clocks
Base Clock
1470 MHz
1440 MHz
Boost Clock
1650 MHz
2520 MHz
Memory Clock
1750 MHz
14 Gbps effective
2250 MHz
18 Gbps effective
Memory
Memory Size
8 GB
48 GB
VRAM (MB)
8,192
49,152
+500.0%
Memory Type
GDDR6
GDDR6
Memory Bus
256 bit
384 bit
Bandwidth
448.0 GB/s
864.0 GB/s
Cache
L1 Cache
64 KB (per SM)
128 KB (per SM)
L2 Cache
4 MB
96 MB
Performance
Pixel Rate
105.6 GPixel/s
322.6 GPixel/s
Texture Rate
237.6 GTexel/s
927.4 GTexel/s
FP32 (TFLOPS)
7.603 TFLOPS
59.35 TFLOPS
FP64 (TFLOPS)
237.6 GFLOPS (1:32)
927.4 GFLOPS (1:64)
FP16 (TFLOPS)
15.21 TFLOPS (2:1)
59.35 TFLOPS (1:1)
AI/RT
RT Cores
36
92
+155.6%
Tensor Cores
288
368
+27.8%
Power
TDP
185 W
275 W
TDP (W)
185
275
+48.6%
Suggested PSU
450 W
600 W
Power Connectors
1x 8-pin
1x 16-pin
Architecture
Architecture
Turing
Ada Lovelace
GPU Name
TU106
AD102
Generation
Mining GPUs
Server Ada
(Lxx)
Process Size
12 nm
5 nm
Transistors
10,800 million
76,300 million
Die Size
445 mm²
609 mm²
Foundry
TSMC
TSMC
Density
24.3M / mm²
125.3M / mm²
API Support
DirectX
12 Ultimate (12_2)
12 Ultimate (12_2)
OpenGL
4.6
4.6
Vulkan
1.4
1.4
OpenCL
3.0
3.0
CUDA
7.5
8.9
Shader Model
6.8
6.8
Physical
Slot Width
Dual-slot
Dual-slot
Length
229 mm
9 inches
267 mm
10.5 inches
Height
111 mm
4.4 inches
111 mm
4.4 inches
Outputs
No outputs
4x DisplayPort 1.4a
Bus Interface
PCIe 1.0 x4
PCIe 4.0 x16
Other
Launch Price
699 USD
—
Production
End-of-life
Active
Predecessor
—
Server Ampere
Successor
—
Server Hopper