GPU Comparison
GEFORCE
NVIDIA CMP 40HX
CORE STATE
TU106
VRAM
8 GB
CLOCK SPEED
1650 MHz
TDP
185 W
BUS WIDTH
256 bit
ARCHITECTURE
Turing
PROCESS
12 nm
LAUNCH DATE
2021
VS
GEFORCE
Tesla T4
CORE STATE
TU104
VRAM
16 GB
CLOCK SPEED
1590 MHz
TDP
70 W
BUS WIDTH
256 bit
ARCHITECTURE
Turing
PROCESS
12 nm
LAUNCH DATE
2018
PERFORMANCE BENCHMARKS
geekbench_opencl
geekbench_vulkan
DETAILED SPECIFICATIONS
SPECIFICATION
CMP 40HX
Tesla T4
Core Specs
Shading Units
2,304
2,560
+11.1%
Shaders
2,304
2,560
+11.1%
TMUs
144
160
+11.1%
ROPs
64
64
0.0%
SM Count
36
40
+11.1%
Clocks
Base Clock
1470 MHz
585 MHz
Boost Clock
1650 MHz
1590 MHz
Memory Clock
1750 MHz
14 Gbps effective
1250 MHz
10 Gbps effective
Memory
Memory Size
8 GB
16 GB
VRAM (MB)
8,192
16,384
+100.0%
Memory Type
GDDR6
GDDR6
Memory Bus
256 bit
256 bit
Bandwidth
448.0 GB/s
320.0 GB/s
Cache
L1 Cache
64 KB (per SM)
64 KB (per SM)
L2 Cache
4 MB
4 MB
Performance
Pixel Rate
105.6 GPixel/s
101.8 GPixel/s
Texture Rate
237.6 GTexel/s
254.4 GTexel/s
FP32 (TFLOPS)
7.603 TFLOPS
8.141 TFLOPS
FP64 (TFLOPS)
237.6 GFLOPS (1:32)
254.4 GFLOPS (1:32)
FP16 (TFLOPS)
15.21 TFLOPS (2:1)
65.13 TFLOPS (8:1)
AI/RT
RT Cores
36
40
+11.1%
Tensor Cores
288
320
+11.1%
Power
TDP
185 W
70 W
TDP (W)
185
70
-62.2%
Suggested PSU
450 W
250 W
Power Connectors
1x 8-pin
None
Architecture
Architecture
Turing
Turing
GPU Name
TU106
TU104
Generation
Mining GPUs
Tesla Turing
(Txx)
Process Size
12 nm
12 nm
Transistors
10,800 million
13,600 million
Die Size
445 mm²
545 mm²
Foundry
TSMC
TSMC
Density
24.3M / mm²
25.0M / mm²
API Support
DirectX
12 Ultimate (12_2)
12 Ultimate (12_2)
OpenGL
4.6
4.6
Vulkan
1.4
1.4
OpenCL
3.0
3.0
CUDA
7.5
7.5
Shader Model
6.8
6.8
Physical
Slot Width
Dual-slot
Single-slot
Length
229 mm
9 inches
168 mm
6.6 inches
Height
111 mm
4.4 inches
—
Outputs
No outputs
No outputs
Bus Interface
PCIe 1.0 x4
PCIe 3.0 x16
Other
Launch Price
699 USD
—
Production
End-of-life
End-of-life
Predecessor
—
Tesla Volta
Successor
—
Server Ampere