GPU Comparison

NVIDIA
GEFORCE

NVIDIA Tesla K40m

CORE STATE GK110B
VRAM 12 GB
CLOCK SPEED 876 MHz
TDP 245 W
BUS WIDTH 384 bit
ARCHITECTURE Kepler
nm
PROCESS 28 nm
LAUNCH DATE 2013
VS
NVIDIA
GEFORCE

Tesla P4

CORE STATE GP104
VRAM 8 GB
CLOCK SPEED 1114 MHz
TDP 75 W
BUS WIDTH 256 bit
ARCHITECTURE Pascal
nm
PROCESS 16 nm
LAUNCH DATE 2016

PERFORMANCE BENCHMARKS

geekbench_opencl
19,519
37,896
geekbench_vulkan
N/A
40,476

DETAILED SPECIFICATIONS

SPECIFICATION
Tesla K40m
Tesla P4
Core Specs
Shading Units
2,880
2,560 -11.1%
Shaders
2,880
2,560 -11.1%
TMUs
240
160 -33.3%
ROPs
48
64 +33.3%
SM Count
20
Clocks
Base Clock
745 MHz
886 MHz
Boost Clock
876 MHz
1114 MHz
Memory Clock
1502 MHz 6 Gbps effective
1502 MHz 6 Gbps effective
Memory
Memory Size
12 GB
8 GB
VRAM (MB)
12,288
8,192 -33.3%
Memory Type
GDDR5
GDDR5
Memory Bus
384 bit
256 bit
Bandwidth
288.4 GB/s
192.3 GB/s
Cache
L1 Cache
16 KB (per SMX)
48 KB (per SM)
L2 Cache
1536 KB
2 MB
Performance
Pixel Rate
52.56 GPixel/s
71.30 GPixel/s
Texture Rate
210.2 GTexel/s
178.2 GTexel/s
FP32 (TFLOPS)
5.046 TFLOPS
5.704 TFLOPS
FP64 (TFLOPS)
1.682 TFLOPS (1:3)
178.2 GFLOPS (1:32)
FP16 (TFLOPS)
89.12 GFLOPS (1:64)
Power
TDP
245 W
75 W
TDP (W)
245
75 -69.4%
Suggested PSU
550 W
250 W
Power Connectors
None
Architecture
Architecture
Kepler
Pascal
GPU Name
GK110B
GP104
Generation
Tesla Kepler (Kxx)
Tesla Pascal (Pxx)
Process Size
28 nm
16 nm
Transistors
7,080 million
7,200 million
Die Size
561 mm²
314 mm²
Foundry
TSMC
TSMC
Density
12.6M / mm²
22.9M / mm²
API Support
DirectX
12 (11_1)
12 (12_1)
OpenGL
4.6
4.6
Vulkan
1.2.175
1.4
OpenCL
3.0
3.0
CUDA
3.5
6.1
Shader Model
6.5 (5.1)
6.8
Physical
Slot Width
Dual-slot
Single-slot
Length
267 mm 10.5 inches
168 mm 6.6 inches
Outputs
No outputs
No outputs
Bus Interface
PCIe 3.0 x16
PCIe 3.0 x16
Other
Launch Price
7,699 USD
Production
End-of-life
End-of-life
Predecessor
Tesla Fermi
Tesla Maxwell
Successor
Tesla Maxwell
Tesla Volta
View Tesla K40m Details View Tesla P4 Details