GPU Comparison
GEFORCE
NVIDIA CMP 40HX
CORE STATE
TU106
VRAM
8 GB
CLOCK SPEED
1650 MHz
TDP
185 W
BUS WIDTH
256 bit
ARCHITECTURE
Turing
PROCESS
12 nm
LAUNCH DATE
2021
VS
GEFORCE
Quadro 4000M
CORE STATE
GF104
VRAM
2 GB
CLOCK SPEED
—
TDP
100 W
BUS WIDTH
256 bit
ARCHITECTURE
Fermi
PROCESS
40 nm
LAUNCH DATE
2011
PERFORMANCE BENCHMARKS
geekbench_opencl
geekbench_vulkan
DETAILED SPECIFICATIONS
SPECIFICATION
CMP 40HX
Quadro 4000M
Core Specs
Shading Units
2,304
336
-85.4%
Shaders
2,304
336
-85.4%
TMUs
144
56
-61.1%
ROPs
64
32
-50.0%
SM Count
36
7
-80.6%
Clocks
Base Clock
1470 MHz
—
Boost Clock
1650 MHz
—
GPU Clock
—
475 MHz
Shader Clock
—
950 MHz
Memory Clock
1750 MHz
14 Gbps effective
625 MHz
2.5 Gbps effective
Memory
Memory Size
8 GB
2 GB
VRAM (MB)
8,192
2,048
-75.0%
Memory Type
GDDR6
GDDR5
Memory Bus
256 bit
256 bit
Bandwidth
448.0 GB/s
80.00 GB/s
Cache
L1 Cache
64 KB (per SM)
64 KB (per SM)
L2 Cache
4 MB
512 KB
Performance
Pixel Rate
105.6 GPixel/s
6.650 GPixel/s
Texture Rate
237.6 GTexel/s
26.60 GTexel/s
FP32 (TFLOPS)
7.603 TFLOPS
638.4 GFLOPS
FP64 (TFLOPS)
237.6 GFLOPS (1:32)
53.20 GFLOPS (1:12)
FP16 (TFLOPS)
15.21 TFLOPS (2:1)
—
AI/RT
RT Cores
36
—
Tensor Cores
288
—
Power
TDP
185 W
100 W
TDP (W)
185
100
-45.9%
Suggested PSU
450 W
—
Power Connectors
1x 8-pin
None
Architecture
Architecture
Turing
Fermi
GPU Name
TU106
GF104
Generation
Mining GPUs
Quadro Fermi-M
(x000M)
Process Size
12 nm
40 nm
Transistors
10,800 million
1,950 million
Die Size
445 mm²
332 mm²
Foundry
TSMC
TSMC
Density
24.3M / mm²
5.9M / mm²
API Support
DirectX
12 Ultimate (12_2)
12 (11_0)
OpenGL
4.6
4.6
Vulkan
1.4
—
OpenCL
3.0
1.1
CUDA
7.5
2.1
Shader Model
6.8
5.1
Physical
Slot Width
Dual-slot
MXM Module
Length
229 mm
9 inches
—
Height
111 mm
4.4 inches
—
Outputs
No outputs
Portable Device Dependent
Bus Interface
PCIe 1.0 x4
MXM-B (3.0)
Other
Launch Price
699 USD
—
Production
End-of-life
End-of-life
Predecessor
—
Quadro FX Mobile
Successor
—
Quadro Kepler-M