GPU Comparison
GEFORCE
NVIDIA Tesla K20Xm
CORE STATE
GK110
VRAM
6 GB
CLOCK SPEED
—
TDP
235 W
BUS WIDTH
384 bit
ARCHITECTURE
Kepler
PROCESS
28 nm
LAUNCH DATE
2012
VS
GEFORCE
Tesla P4
CORE STATE
GP104
VRAM
8 GB
CLOCK SPEED
1114 MHz
TDP
75 W
BUS WIDTH
256 bit
ARCHITECTURE
Pascal
PROCESS
16 nm
LAUNCH DATE
2016
PERFORMANCE BENCHMARKS
geekbench_metal
geekbench_opencl
geekbench_vulkan
DETAILED SPECIFICATIONS
SPECIFICATION
Tesla K20Xm
Tesla P4
Core Specs
Shading Units
2,688
2,560
-4.8%
Shaders
2,688
2,560
-4.8%
TMUs
224
160
-28.6%
ROPs
48
64
+33.3%
SM Count
—
20
Clocks
Base Clock
—
886 MHz
Boost Clock
—
1114 MHz
GPU Clock
732 MHz
—
Memory Clock
1300 MHz
5.2 Gbps effective
1502 MHz
6 Gbps effective
Memory
Memory Size
6 GB
8 GB
VRAM (MB)
6,144
8,192
+33.3%
Memory Type
GDDR5
GDDR5
Memory Bus
384 bit
256 bit
Bandwidth
249.6 GB/s
192.3 GB/s
Cache
L1 Cache
16 KB (per SMX)
48 KB (per SM)
L2 Cache
1536 KB
2 MB
Performance
Pixel Rate
40.99 GPixel/s
71.30 GPixel/s
Texture Rate
164.0 GTexel/s
178.2 GTexel/s
FP32 (TFLOPS)
3.935 TFLOPS
5.704 TFLOPS
FP64 (TFLOPS)
1,311.7 GFLOPS (1:3)
178.2 GFLOPS (1:32)
FP16 (TFLOPS)
—
89.12 GFLOPS (1:64)
Power
TDP
235 W
75 W
TDP (W)
235
75
-68.1%
Suggested PSU
550 W
250 W
Power Connectors
—
None
Architecture
Architecture
Kepler
Pascal
GPU Name
GK110
GP104
Generation
Tesla Kepler
(Kxx)
Tesla Pascal
(Pxx)
Process Size
28 nm
16 nm
Transistors
7,080 million
7,200 million
Die Size
561 mm²
314 mm²
Foundry
TSMC
TSMC
Density
12.6M / mm²
22.9M / mm²
API Support
DirectX
12 (11_0)
12 (12_1)
OpenGL
4.6
4.6
Vulkan
1.2.175
1.4
OpenCL
3.0
3.0
CUDA
3.5
6.1
Shader Model
6.5 (5.1)
6.8
Physical
Slot Width
Dual-slot
Single-slot
Length
267 mm
10.5 inches
168 mm
6.6 inches
Outputs
No outputs
No outputs
Bus Interface
PCIe 3.0 x16
PCIe 3.0 x16
Other
Launch Price
7,699 USD
—
Production
End-of-life
End-of-life
Predecessor
Tesla Fermi
Tesla Maxwell
Successor
Tesla Maxwell
Tesla Volta