GPU Comparison

NVIDIA
GEFORCE

NVIDIA P102-100

CORE STATE GP102
VRAM 5 GB
CLOCK SPEED 1683 MHz
TDP 250 W
BUS WIDTH 320 bit
ARCHITECTURE Pascal
nm
PROCESS 16 nm
LAUNCH DATE 2018
VS
NVIDIA
GEFORCE

Tesla M40

CORE STATE GM200
VRAM 12 GB
CLOCK SPEED 1112 MHz
TDP 250 W
BUS WIDTH 384 bit
ARCHITECTURE Maxwell 2.0
nm
PROCESS 28 nm
LAUNCH DATE 2015

PERFORMANCE BENCHMARKS

geekbench_opencl
49,602
39,192
geekbench_vulkan
65,600
44,602

DETAILED SPECIFICATIONS

SPECIFICATION
P102-100
Tesla M40
Core Specs
Shading Units
3,200
3,072 -4.0%
Shaders
3,200
3,072 -4.0%
TMUs
200
192 -4.0%
ROPs
80
96 +20.0%
SM Count
25
Clocks
Base Clock
1582 MHz
948 MHz
Boost Clock
1683 MHz
1112 MHz
Memory Clock
1376 MHz 11 Gbps effective
1502 MHz 6 Gbps effective
Memory
Memory Size
5 GB
12 GB
VRAM (MB)
5,120
12,288 +140.0%
Memory Type
GDDR5X
GDDR5
Memory Bus
320 bit
384 bit
Bandwidth
440.3 GB/s
288.4 GB/s
Cache
L1 Cache
48 KB (per SM)
48 KB (per SMM)
L2 Cache
2.5 MB
3 MB
Performance
Pixel Rate
134.6 GPixel/s
106.8 GPixel/s
Texture Rate
336.6 GTexel/s
213.5 GTexel/s
FP32 (TFLOPS)
10.77 TFLOPS
6.832 TFLOPS
FP64 (TFLOPS)
336.6 GFLOPS (1:32)
213.5 GFLOPS (1:32)
FP16 (TFLOPS)
168.3 GFLOPS (1:64)
Power
TDP
250 W
250 W
TDP (W)
250
250 0.0%
Suggested PSU
600 W
600 W
Power Connectors
2x 8-pin
8-pin EPS
Architecture
Architecture
Pascal
Maxwell 2.0
GPU Name
GP102
GM200
Generation
Mining GPUs
Tesla Maxwell (Mxx)
Process Size
16 nm
28 nm
Transistors
11,800 million
8,000 million
Die Size
471 mm²
601 mm²
Foundry
TSMC
TSMC
Density
25.1M / mm²
13.3M / mm²
API Support
DirectX
12 (12_1)
12 (12_1)
OpenGL
4.6
4.6
Vulkan
1.4
1.4
OpenCL
3.0
3.0
CUDA
6.1
5.2
Shader Model
6.8
6.8
Physical
Slot Width
Dual-slot
Dual-slot
Length
267 mm 10.5 inches
267 mm 10.5 inches
Outputs
No outputs
No outputs
Bus Interface
PCIe 1.0 x4
PCIe 3.0 x16
Other
Production
End-of-life
End-of-life
Predecessor
Tesla Kepler
Successor
Tesla Pascal
View P102-100 Details View Tesla M40 Details