GPU Comparison
GEFORCE
NVIDIA CMP 70HX
CORE STATE
GA104
VRAM
8 GB
CLOCK SPEED
1395 MHz
TDP
—
BUS WIDTH
256 bit
ARCHITECTURE
Ampere
PROCESS
8 nm
LAUNCH DATE
—
VS
GEFORCE
Tesla T4
CORE STATE
TU104
VRAM
16 GB
CLOCK SPEED
1590 MHz
TDP
70 W
BUS WIDTH
256 bit
ARCHITECTURE
Turing
PROCESS
12 nm
LAUNCH DATE
2018
PERFORMANCE BENCHMARKS
geekbench_opencl
geekbench_vulkan
DETAILED SPECIFICATIONS
SPECIFICATION
CMP 70HX
Tesla T4
Core Specs
Shading Units
3,840
2,560
-33.3%
Shaders
3,840
2,560
-33.3%
TMUs
120
160
+33.3%
ROPs
64
64
0.0%
SM Count
30
40
+33.3%
Clocks
Base Clock
1365 MHz
585 MHz
Boost Clock
1395 MHz
1590 MHz
Memory Clock
1188 MHz
19 Gbps effective
1250 MHz
10 Gbps effective
Memory
Memory Size
8 GB
16 GB
VRAM (MB)
8,192
16,384
+100.0%
Memory Type
GDDR6X
GDDR6
Memory Bus
256 bit
256 bit
Bandwidth
608.3 GB/s
320.0 GB/s
Cache
L1 Cache
128 KB (per SM)
64 KB (per SM)
L2 Cache
4 MB
4 MB
Performance
Pixel Rate
89.28 GPixel/s
101.8 GPixel/s
Texture Rate
167.4 GTexel/s
254.4 GTexel/s
FP32 (TFLOPS)
10.71 TFLOPS
8.141 TFLOPS
FP64 (TFLOPS)
167.4 GFLOPS (1:64)
254.4 GFLOPS (1:32)
FP16 (TFLOPS)
10.71 TFLOPS (1:1)
65.13 TFLOPS (8:1)
AI/RT
RT Cores
30
40
+33.3%
Tensor Cores
120
320
+166.7%
Power
TDP
—
70 W
TDP (W)
—
70
Suggested PSU
200 W
250 W
Power Connectors
1x 12-pin
None
Architecture
Architecture
Ampere
Turing
GPU Name
GA104
TU104
Generation
Mining GPUs
Tesla Turing
(Txx)
Process Size
8 nm
12 nm
Transistors
17,400 million
13,600 million
Die Size
392 mm²
545 mm²
Foundry
Samsung
TSMC
Density
44.4M / mm²
25.0M / mm²
API Support
DirectX
12 Ultimate (12_2)
12 Ultimate (12_2)
OpenGL
4.6
4.6
Vulkan
1.4
1.4
OpenCL
3.0
3.0
CUDA
8.6
7.5
Shader Model
6.8
6.8
Physical
Slot Width
Dual-slot
Single-slot
Length
267 mm
10.5 inches
168 mm
6.6 inches
Height
112 mm
4.4 inches
—
Outputs
No outputs
No outputs
Bus Interface
PCIe 1.0 x4
PCIe 3.0 x16
Other
Production
End-of-life
End-of-life
Predecessor
—
Tesla Volta
Successor
—
Server Ampere