GPU Comparison
GEFORCE
NVIDIA Quadro 4000M
CORE STATE
GF104
VRAM
2 GB
CLOCK SPEED
—
TDP
100 W
BUS WIDTH
256 bit
ARCHITECTURE
Fermi
PROCESS
40 nm
LAUNCH DATE
2011
VS
GEFORCE
Tesla T4
CORE STATE
TU104
VRAM
16 GB
CLOCK SPEED
1590 MHz
TDP
70 W
BUS WIDTH
256 bit
ARCHITECTURE
Turing
PROCESS
12 nm
LAUNCH DATE
2018
PERFORMANCE BENCHMARKS
geekbench_opencl
geekbench_vulkan
DETAILED SPECIFICATIONS
SPECIFICATION
Quadro 4000M
Tesla T4
Core Specs
Shading Units
336
2,560
+661.9%
Shaders
336
2,560
+661.9%
TMUs
56
160
+185.7%
ROPs
32
64
+100.0%
SM Count
7
40
+471.4%
Clocks
Base Clock
—
585 MHz
Boost Clock
—
1590 MHz
GPU Clock
475 MHz
—
Shader Clock
950 MHz
—
Memory Clock
625 MHz
2.5 Gbps effective
1250 MHz
10 Gbps effective
Memory
Memory Size
2 GB
16 GB
VRAM (MB)
2,048
16,384
+700.0%
Memory Type
GDDR5
GDDR6
Memory Bus
256 bit
256 bit
Bandwidth
80.00 GB/s
320.0 GB/s
Cache
L1 Cache
64 KB (per SM)
64 KB (per SM)
L2 Cache
512 KB
4 MB
Performance
Pixel Rate
6.650 GPixel/s
101.8 GPixel/s
Texture Rate
26.60 GTexel/s
254.4 GTexel/s
FP32 (TFLOPS)
638.4 GFLOPS
8.141 TFLOPS
FP64 (TFLOPS)
53.20 GFLOPS (1:12)
254.4 GFLOPS (1:32)
FP16 (TFLOPS)
—
65.13 TFLOPS (8:1)
AI/RT
RT Cores
—
40
Tensor Cores
—
320
Power
TDP
100 W
70 W
TDP (W)
100
70
-30.0%
Suggested PSU
—
250 W
Power Connectors
None
None
Architecture
Architecture
Fermi
Turing
GPU Name
GF104
TU104
Generation
Quadro Fermi-M
(x000M)
Tesla Turing
(Txx)
Process Size
40 nm
12 nm
Transistors
1,950 million
13,600 million
Die Size
332 mm²
545 mm²
Foundry
TSMC
TSMC
Density
5.9M / mm²
25.0M / mm²
API Support
DirectX
12 (11_0)
12 Ultimate (12_2)
OpenGL
4.6
4.6
Vulkan
—
1.4
OpenCL
1.1
3.0
CUDA
2.1
7.5
Shader Model
5.1
6.8
Physical
Slot Width
MXM Module
Single-slot
Length
—
168 mm
6.6 inches
Outputs
Portable Device Dependent
No outputs
Bus Interface
MXM-B (3.0)
PCIe 3.0 x16
Other
Production
End-of-life
End-of-life
Predecessor
Quadro FX Mobile
Tesla Volta
Successor
Quadro Kepler-M
Server Ampere