GPU Comparison
GEFORCE
NVIDIA Tesla P4
CORE STATE
GP104
VRAM
8 GB
CLOCK SPEED
1114 MHz
TDP
75 W
BUS WIDTH
256 bit
ARCHITECTURE
Pascal
PROCESS
16 nm
LAUNCH DATE
2016
VS
GEFORCE
Tesla P40
CORE STATE
GP102
VRAM
24 GB
CLOCK SPEED
1531 MHz
TDP
250 W
BUS WIDTH
384 bit
ARCHITECTURE
Pascal
PROCESS
16 nm
LAUNCH DATE
2016
PERFORMANCE BENCHMARKS
geekbench_opencl
geekbench_vulkan
DETAILED SPECIFICATIONS
SPECIFICATION
Tesla P4
Tesla P40
Core Specs
Shading Units
2,560
3,840
+50.0%
Shaders
2,560
3,840
+50.0%
TMUs
160
240
+50.0%
ROPs
64
96
+50.0%
SM Count
20
30
+50.0%
Clocks
Base Clock
886 MHz
1303 MHz
Boost Clock
1114 MHz
1531 MHz
Memory Clock
1502 MHz
6 Gbps effective
1808 MHz
7.2 Gbps effective
Memory
Memory Size
8 GB
24 GB
VRAM (MB)
8,192
24,576
+200.0%
Memory Type
GDDR5
GDDR5
Memory Bus
256 bit
384 bit
Bandwidth
192.3 GB/s
347.1 GB/s
Cache
L1 Cache
48 KB (per SM)
48 KB (per SM)
L2 Cache
2 MB
3 MB
Performance
Pixel Rate
71.30 GPixel/s
147.0 GPixel/s
Texture Rate
178.2 GTexel/s
367.4 GTexel/s
FP32 (TFLOPS)
5.704 TFLOPS
11.76 TFLOPS
FP64 (TFLOPS)
178.2 GFLOPS (1:32)
367.4 GFLOPS (1:32)
FP16 (TFLOPS)
89.12 GFLOPS (1:64)
183.7 GFLOPS (1:64)
Power
TDP
75 W
250 W
TDP (W)
75
250
+233.3%
Suggested PSU
250 W
600 W
Power Connectors
None
8-pin EPS
Architecture
Architecture
Pascal
Pascal
GPU Name
GP104
GP102
Generation
Tesla Pascal
(Pxx)
Tesla Pascal
(Pxx)
Process Size
16 nm
16 nm
Transistors
7,200 million
11,800 million
Die Size
314 mm²
471 mm²
Foundry
TSMC
TSMC
Density
22.9M / mm²
25.1M / mm²
API Support
DirectX
12 (12_1)
12 (12_1)
OpenGL
4.6
4.6
Vulkan
1.4
1.4
OpenCL
3.0
3.0
CUDA
6.1
6.1
Shader Model
6.8
6.8
Physical
Slot Width
Single-slot
Dual-slot
Length
168 mm
6.6 inches
267 mm
10.5 inches
Height
—
111 mm
4.4 inches
Outputs
No outputs
No outputs
Bus Interface
PCIe 3.0 x16
PCIe 3.0 x16
Other
Launch Price
—
5,699 USD
Production
End-of-life
End-of-life
Predecessor
Tesla Maxwell
Tesla Maxwell
Successor
Tesla Volta
Tesla Volta