GPU Comparison
GEFORCE
NVIDIA Tesla M4
CORE STATE
GM206
VRAM
4 GB
CLOCK SPEED
1072 MHz
TDP
50 W
BUS WIDTH
128 bit
ARCHITECTURE
Maxwell 2.0
PROCESS
28 nm
LAUNCH DATE
2015
VS
GEFORCE
Tesla P4
CORE STATE
GP104
VRAM
8 GB
CLOCK SPEED
1114 MHz
TDP
75 W
BUS WIDTH
256 bit
ARCHITECTURE
Pascal
PROCESS
16 nm
LAUNCH DATE
2016
PERFORMANCE BENCHMARKS
geekbench_opencl
geekbench_vulkan
DETAILED SPECIFICATIONS
SPECIFICATION
Tesla M4
Tesla P4
Core Specs
Shading Units
1,024
2,560
+150.0%
Shaders
1,024
2,560
+150.0%
TMUs
64
160
+150.0%
ROPs
32
64
+100.0%
SM Count
—
20
Clocks
Base Clock
872 MHz
886 MHz
Boost Clock
1072 MHz
1114 MHz
Memory Clock
1375 MHz
5.5 Gbps effective
1502 MHz
6 Gbps effective
Memory
Memory Size
4 GB
8 GB
VRAM (MB)
4,096
8,192
+100.0%
Memory Type
GDDR5
GDDR5
Memory Bus
128 bit
256 bit
Bandwidth
88.00 GB/s
192.3 GB/s
Cache
L1 Cache
48 KB (per SMM)
48 KB (per SM)
L2 Cache
1024 KB
2 MB
Performance
Pixel Rate
34.30 GPixel/s
71.30 GPixel/s
Texture Rate
68.61 GTexel/s
178.2 GTexel/s
FP32 (TFLOPS)
2.195 TFLOPS
5.704 TFLOPS
FP64 (TFLOPS)
68.61 GFLOPS (1:32)
178.2 GFLOPS (1:32)
FP16 (TFLOPS)
—
89.12 GFLOPS (1:64)
Power
TDP
50 W
75 W
TDP (W)
50
75
+50.0%
Suggested PSU
250 W
250 W
Power Connectors
—
None
Architecture
Architecture
Maxwell 2.0
Pascal
GPU Name
GM206
GP104
Generation
Tesla Maxwell
(Mxx)
Tesla Pascal
(Pxx)
Process Size
28 nm
16 nm
Transistors
2,940 million
7,200 million
Die Size
228 mm²
314 mm²
Foundry
TSMC
TSMC
Density
12.9M / mm²
22.9M / mm²
API Support
DirectX
12 (12_1)
12 (12_1)
OpenGL
4.6
4.6
Vulkan
1.4
1.4
OpenCL
3.0
3.0
CUDA
5.2
6.1
Shader Model
6.8
6.8
Physical
Slot Width
Single-slot
Single-slot
Length
—
168 mm
6.6 inches
Outputs
No outputs
No outputs
Bus Interface
PCIe 3.0 x16
PCIe 3.0 x16
Other
Production
End-of-life
End-of-life
Predecessor
Tesla Kepler
Tesla Maxwell
Successor
Tesla Pascal
Tesla Volta