GPU Comparison
GEFORCE
NVIDIA CMP 40HX
CORE STATE
TU106
VRAM
8 GB
CLOCK SPEED
1650 MHz
TDP
185 W
BUS WIDTH
256 bit
ARCHITECTURE
Turing
PROCESS
12 nm
LAUNCH DATE
2021
VS
GEFORCE
Tesla M2090
CORE STATE
GF110
VRAM
6 GB
CLOCK SPEED
—
TDP
250 W
BUS WIDTH
384 bit
ARCHITECTURE
Fermi 2.0
PROCESS
40 nm
LAUNCH DATE
2011
PERFORMANCE BENCHMARKS
geekbench_opencl
geekbench_vulkan
DETAILED SPECIFICATIONS
SPECIFICATION
CMP 40HX
Tesla M2090
Core Specs
Shading Units
2,304
512
-77.8%
Shaders
2,304
512
-77.8%
TMUs
144
64
-55.6%
ROPs
64
48
-25.0%
SM Count
36
16
-55.6%
Clocks
Base Clock
1470 MHz
—
Boost Clock
1650 MHz
—
GPU Clock
—
651 MHz
Shader Clock
—
1301 MHz
Memory Clock
1750 MHz
14 Gbps effective
924 MHz
3.7 Gbps effective
Memory
Memory Size
8 GB
6 GB
VRAM (MB)
8,192
6,144
-25.0%
Memory Type
GDDR6
GDDR5
Memory Bus
256 bit
384 bit
Bandwidth
448.0 GB/s
177.4 GB/s
Cache
L1 Cache
64 KB (per SM)
64 KB (per SM)
L2 Cache
4 MB
768 KB
Performance
Pixel Rate
105.6 GPixel/s
20.83 GPixel/s
Texture Rate
237.6 GTexel/s
41.66 GTexel/s
FP32 (TFLOPS)
7.603 TFLOPS
1,332.2 GFLOPS
FP64 (TFLOPS)
237.6 GFLOPS (1:32)
666.1 GFLOPS (1:2)
FP16 (TFLOPS)
15.21 TFLOPS (2:1)
—
AI/RT
RT Cores
36
—
Tensor Cores
288
—
Power
TDP
185 W
250 W
TDP (W)
185
250
+35.1%
Suggested PSU
450 W
600 W
Power Connectors
1x 8-pin
1x 6-pin + 1x 8-pin
Architecture
Architecture
Turing
Fermi 2.0
GPU Name
TU106
GF110
Generation
Mining GPUs
Tesla Fermi
(x20xx)
Process Size
12 nm
40 nm
Transistors
10,800 million
3,000 million
Die Size
445 mm²
520 mm²
Foundry
TSMC
TSMC
Density
24.3M / mm²
5.8M / mm²
API Support
DirectX
12 Ultimate (12_2)
12 (11_0)
OpenGL
4.6
4.6
Vulkan
1.4
—
OpenCL
3.0
1.1
CUDA
7.5
2.0
Shader Model
6.8
5.1
Physical
Slot Width
Dual-slot
Dual-slot
Length
229 mm
9 inches
248 mm
9.8 inches
Height
111 mm
4.4 inches
—
Outputs
No outputs
No outputs
Bus Interface
PCIe 1.0 x4
PCIe 2.0 x16
Other
Launch Price
699 USD
—
Production
End-of-life
End-of-life
Predecessor
—
Tesla
Successor
—
Tesla Kepler