GPU Comparison
GEFORCE
NVIDIA GeForce GTX 870M
CORE STATE
GK104
VRAM
3 GB
CLOCK SPEED
967 MHz
TDP
100 W
BUS WIDTH
192 bit
ARCHITECTURE
Kepler
PROCESS
28 nm
LAUNCH DATE
2014
VS
GEFORCE
Tesla P4
CORE STATE
GP104
VRAM
8 GB
CLOCK SPEED
1114 MHz
TDP
75 W
BUS WIDTH
256 bit
ARCHITECTURE
Pascal
PROCESS
16 nm
LAUNCH DATE
2016
PERFORMANCE BENCHMARKS
geekbench_metal
geekbench_opencl
geekbench_vulkan
DETAILED SPECIFICATIONS
SPECIFICATION
GTX 870M
Tesla P4
Core Specs
Shading Units
1,344
2,560
+90.5%
Shaders
1,344
2,560
+90.5%
TMUs
112
160
+42.9%
ROPs
24
64
+166.7%
SM Count
—
20
Clocks
Base Clock
941 MHz
886 MHz
Boost Clock
967 MHz
1114 MHz
Memory Clock
1250 MHz
5 Gbps effective
1502 MHz
6 Gbps effective
Memory
Memory Size
3 GB
8 GB
VRAM (MB)
3,072
8,192
+166.7%
Memory Type
GDDR5
GDDR5
Memory Bus
192 bit
256 bit
Bandwidth
120.0 GB/s
192.3 GB/s
Cache
L1 Cache
16 KB (per SMX)
48 KB (per SM)
L2 Cache
384 KB
2 MB
Performance
Pixel Rate
27.08 GPixel/s
71.30 GPixel/s
Texture Rate
108.3 GTexel/s
178.2 GTexel/s
FP32 (TFLOPS)
2.599 TFLOPS
5.704 TFLOPS
FP64 (TFLOPS)
108.3 GFLOPS (1:24)
178.2 GFLOPS (1:32)
FP16 (TFLOPS)
—
89.12 GFLOPS (1:64)
Power
TDP
100 W
75 W
TDP (W)
100
75
-25.0%
Suggested PSU
—
250 W
Power Connectors
None
None
Architecture
Architecture
Kepler
Pascal
GPU Name
GK104
GP104
Generation
GeForce 800M
Tesla Pascal
(Pxx)
Process Size
28 nm
16 nm
Transistors
3,540 million
7,200 million
Die Size
294 mm²
314 mm²
Foundry
TSMC
TSMC
Density
12.0M / mm²
22.9M / mm²
API Support
DirectX
12 (11_0)
12 (12_1)
OpenGL
4.6
4.6
Vulkan
1.2.175
1.4
OpenCL
3.0
3.0
CUDA
3.0
6.1
Shader Model
6.5 (5.1)
6.8
Physical
Slot Width
MXM Module
Single-slot
Length
—
168 mm
6.6 inches
Outputs
Portable Device Dependent
No outputs
Bus Interface
MXM-B (3.0)
PCIe 3.0 x16
Other
Production
End-of-life
End-of-life
Predecessor
GeForce 700M
Tesla Maxwell
Successor
GeForce 900M
Tesla Volta