GPU Comparison
GEFORCE
NVIDIA Quadro 4000M
CORE STATE
GF104
VRAM
2 GB
CLOCK SPEED
—
TDP
100 W
BUS WIDTH
256 bit
ARCHITECTURE
Fermi
PROCESS
40 nm
LAUNCH DATE
2011
VS
GEFORCE
Tesla P4
CORE STATE
GP104
VRAM
8 GB
CLOCK SPEED
1114 MHz
TDP
75 W
BUS WIDTH
256 bit
ARCHITECTURE
Pascal
PROCESS
16 nm
LAUNCH DATE
2016
PERFORMANCE BENCHMARKS
geekbench_opencl
geekbench_vulkan
DETAILED SPECIFICATIONS
SPECIFICATION
Quadro 4000M
Tesla P4
Core Specs
Shading Units
336
2,560
+661.9%
Shaders
336
2,560
+661.9%
TMUs
56
160
+185.7%
ROPs
32
64
+100.0%
SM Count
7
20
+185.7%
Clocks
Base Clock
—
886 MHz
Boost Clock
—
1114 MHz
GPU Clock
475 MHz
—
Shader Clock
950 MHz
—
Memory Clock
625 MHz
2.5 Gbps effective
1502 MHz
6 Gbps effective
Memory
Memory Size
2 GB
8 GB
VRAM (MB)
2,048
8,192
+300.0%
Memory Type
GDDR5
GDDR5
Memory Bus
256 bit
256 bit
Bandwidth
80.00 GB/s
192.3 GB/s
Cache
L1 Cache
64 KB (per SM)
48 KB (per SM)
L2 Cache
512 KB
2 MB
Performance
Pixel Rate
6.650 GPixel/s
71.30 GPixel/s
Texture Rate
26.60 GTexel/s
178.2 GTexel/s
FP32 (TFLOPS)
638.4 GFLOPS
5.704 TFLOPS
FP64 (TFLOPS)
53.20 GFLOPS (1:12)
178.2 GFLOPS (1:32)
FP16 (TFLOPS)
—
89.12 GFLOPS (1:64)
Power
TDP
100 W
75 W
TDP (W)
100
75
-25.0%
Suggested PSU
—
250 W
Power Connectors
None
None
Architecture
Architecture
Fermi
Pascal
GPU Name
GF104
GP104
Generation
Quadro Fermi-M
(x000M)
Tesla Pascal
(Pxx)
Process Size
40 nm
16 nm
Transistors
1,950 million
7,200 million
Die Size
332 mm²
314 mm²
Foundry
TSMC
TSMC
Density
5.9M / mm²
22.9M / mm²
API Support
DirectX
12 (11_0)
12 (12_1)
OpenGL
4.6
4.6
Vulkan
—
1.4
OpenCL
1.1
3.0
CUDA
2.1
6.1
Shader Model
5.1
6.8
Physical
Slot Width
MXM Module
Single-slot
Length
—
168 mm
6.6 inches
Outputs
Portable Device Dependent
No outputs
Bus Interface
MXM-B (3.0)
PCIe 3.0 x16
Other
Production
End-of-life
End-of-life
Predecessor
Quadro FX Mobile
Tesla Maxwell
Successor
Quadro Kepler-M
Tesla Volta