GPU Comparison
GEFORCE
NVIDIA CMP 40HX
CORE STATE
TU106
VRAM
8 GB
CLOCK SPEED
1650 MHz
TDP
185 W
BUS WIDTH
256 bit
ARCHITECTURE
Turing
PROCESS
12 nm
LAUNCH DATE
2021
VS
GEFORCE
P104-100
CORE STATE
GP104
VRAM
4 GB
CLOCK SPEED
1733 MHz
TDP
—
BUS WIDTH
256 bit
ARCHITECTURE
Pascal
PROCESS
16 nm
LAUNCH DATE
2017
PERFORMANCE BENCHMARKS
geekbench_opencl
geekbench_vulkan
3dmark_3dmark_steel_nomad_dx12
DETAILED SPECIFICATIONS
SPECIFICATION
CMP 40HX
P104-100
Core Specs
Shading Units
2,304
1,920
-16.7%
Shaders
2,304
1,920
-16.7%
TMUs
144
120
-16.7%
ROPs
64
64
0.0%
SM Count
36
15
-58.3%
Clocks
Base Clock
1470 MHz
1607 MHz
Boost Clock
1650 MHz
1733 MHz
Memory Clock
1750 MHz
14 Gbps effective
1251 MHz
10 Gbps effective
Memory
Memory Size
8 GB
4 GB
VRAM (MB)
8,192
4,096
-50.0%
Memory Type
GDDR6
GDDR5X
Memory Bus
256 bit
256 bit
Bandwidth
448.0 GB/s
320.3 GB/s
Cache
L1 Cache
64 KB (per SM)
48 KB (per SM)
L2 Cache
4 MB
2 MB
Performance
Pixel Rate
105.6 GPixel/s
110.9 GPixel/s
Texture Rate
237.6 GTexel/s
208.0 GTexel/s
FP32 (TFLOPS)
7.603 TFLOPS
6.655 TFLOPS
FP64 (TFLOPS)
237.6 GFLOPS (1:32)
208.0 GFLOPS (1:32)
FP16 (TFLOPS)
15.21 TFLOPS (2:1)
104.0 GFLOPS (1:64)
AI/RT
RT Cores
36
—
Tensor Cores
288
—
Power
TDP
185 W
—
TDP (W)
185
—
Suggested PSU
450 W
200 W
Power Connectors
1x 8-pin
1x 8-pin
Architecture
Architecture
Turing
Pascal
GPU Name
TU106
GP104
Generation
Mining GPUs
Mining GPUs
Process Size
12 nm
16 nm
Transistors
10,800 million
7,200 million
Die Size
445 mm²
314 mm²
Foundry
TSMC
TSMC
Density
24.3M / mm²
22.9M / mm²
API Support
DirectX
12 Ultimate (12_2)
12 (12_1)
OpenGL
4.6
4.6
Vulkan
1.4
1.4
OpenCL
3.0
3.0
CUDA
7.5
6.1
Shader Model
6.8
6.8
Physical
Slot Width
Dual-slot
Dual-slot
Length
229 mm
9 inches
267 mm
10.5 inches
Height
111 mm
4.4 inches
—
Outputs
No outputs
No outputs
Bus Interface
PCIe 1.0 x4
PCIe 1.0 x4
Other
Launch Price
699 USD
—
Production
End-of-life
End-of-life