GPU Comparison
GEFORCE
NVIDIA Quadro GP100
CORE STATE
GP100
VRAM
16 GB
CLOCK SPEED
1443 MHz
TDP
235 W
BUS WIDTH
4096 bit
ARCHITECTURE
Pascal
PROCESS
16 nm
LAUNCH DATE
2016
VS
GEFORCE
Tesla M4
CORE STATE
GM206
VRAM
4 GB
CLOCK SPEED
1072 MHz
TDP
50 W
BUS WIDTH
128 bit
ARCHITECTURE
Maxwell 2.0
PROCESS
28 nm
LAUNCH DATE
2015
PERFORMANCE BENCHMARKS
geekbench_opencl
DETAILED SPECIFICATIONS
SPECIFICATION
Quadro GP100
Tesla M4
Core Specs
Shading Units
3,584
1,024
-71.4%
Shaders
3,584
1,024
-71.4%
TMUs
224
64
-71.4%
ROPs
96
32
-66.7%
SM Count
56
—
Clocks
Base Clock
1304 MHz
872 MHz
Boost Clock
1443 MHz
1072 MHz
Memory Clock
715 MHz
1430 Mbps effective
1375 MHz
5.5 Gbps effective
Memory
Memory Size
16 GB
4 GB
VRAM (MB)
16,384
4,096
-75.0%
Memory Type
HBM2
GDDR5
Memory Bus
4096 bit
128 bit
Bandwidth
732.2 GB/s
88.00 GB/s
Cache
L1 Cache
24 KB (per SM)
48 KB (per SMM)
L2 Cache
4 MB
1024 KB
Performance
Pixel Rate
138.5 GPixel/s
34.30 GPixel/s
Texture Rate
323.2 GTexel/s
68.61 GTexel/s
FP32 (TFLOPS)
10.34 TFLOPS
2.195 TFLOPS
FP64 (TFLOPS)
5.172 TFLOPS (1:2)
68.61 GFLOPS (1:32)
FP16 (TFLOPS)
20.69 TFLOPS (2:1)
—
Power
TDP
235 W
50 W
TDP (W)
235
50
-78.7%
Suggested PSU
550 W
250 W
Power Connectors
1x 8-pin
—
Architecture
Architecture
Pascal
Maxwell 2.0
GPU Name
GP100
GM206
Generation
Quadro Pascal
(Px000)
Tesla Maxwell
(Mxx)
Process Size
16 nm
28 nm
Transistors
15,300 million
2,940 million
Die Size
610 mm²
228 mm²
Foundry
TSMC
TSMC
Density
25.1M / mm²
12.9M / mm²
API Support
DirectX
12 (12_1)
12 (12_1)
OpenGL
4.6
4.6
Vulkan
1.3
1.4
OpenCL
3.0
3.0
CUDA
6.0
5.2
Shader Model
6.0
6.8
Physical
Slot Width
Dual-slot
Single-slot
Length
267 mm
10.5 inches
—
Height
111 mm
4.4 inches
—
Outputs
1x DVI4x DisplayPort 1.4a
No outputs
Bus Interface
PCIe 3.0 x16
PCIe 3.0 x16
Other
Production
End-of-life
End-of-life
Predecessor
Quadro Maxwell
Tesla Kepler
Successor
Quadro Volta
Tesla Pascal