GPU Comparison
GEFORCE
NVIDIA Quadro K4000M
CORE STATE
GK104
VRAM
4 GB
CLOCK SPEED
601 MHz
TDP
100 W
BUS WIDTH
256 bit
ARCHITECTURE
Kepler
PROCESS
28 nm
LAUNCH DATE
2012
VS
GEFORCE
Tesla M2090
CORE STATE
GF110
VRAM
6 GB
CLOCK SPEED
—
TDP
250 W
BUS WIDTH
384 bit
ARCHITECTURE
Fermi 2.0
PROCESS
40 nm
LAUNCH DATE
2011
PERFORMANCE BENCHMARKS
geekbench_opencl
DETAILED SPECIFICATIONS
SPECIFICATION
Quadro K4000M
Tesla M2090
Core Specs
Shading Units
960
512
-46.7%
Shaders
960
512
-46.7%
TMUs
80
64
-20.0%
ROPs
32
48
+50.0%
SM Count
—
16
Clocks
Base Clock
601 MHz
—
Boost Clock
601 MHz
—
GPU Clock
—
651 MHz
Shader Clock
—
1301 MHz
Memory Clock
700 MHz
2.8 Gbps effective
924 MHz
3.7 Gbps effective
Memory
Memory Size
4 GB
6 GB
VRAM (MB)
4,096
6,144
+50.0%
Memory Type
GDDR5
GDDR5
Memory Bus
256 bit
384 bit
Bandwidth
89.60 GB/s
177.4 GB/s
Cache
L1 Cache
16 KB (per SMX)
64 KB (per SM)
L2 Cache
512 KB
768 KB
Performance
Pixel Rate
12.02 GPixel/s
20.83 GPixel/s
Texture Rate
48.08 GTexel/s
41.66 GTexel/s
FP32 (TFLOPS)
1,153.9 GFLOPS
1,332.2 GFLOPS
FP64 (TFLOPS)
48.08 GFLOPS (1:24)
666.1 GFLOPS (1:2)
Power
TDP
100 W
250 W
TDP (W)
100
250
+150.0%
Suggested PSU
—
600 W
Power Connectors
None
1x 6-pin + 1x 8-pin
Architecture
Architecture
Kepler
Fermi 2.0
GPU Name
GK104
GF110
Generation
Quadro Kepler-M
(Kx000M)
Tesla Fermi
(x20xx)
Process Size
28 nm
40 nm
Transistors
3,540 million
3,000 million
Die Size
294 mm²
520 mm²
Foundry
TSMC
TSMC
Density
12.0M / mm²
5.8M / mm²
API Support
DirectX
12 (11_0)
12 (11_0)
OpenGL
4.6
4.6
Vulkan
1.2.175
—
OpenCL
3.0
1.1
CUDA
3.0
2.0
Shader Model
6.5 (5.1)
5.1
Physical
Slot Width
MXM Module
Dual-slot
Length
—
248 mm
9.8 inches
Outputs
Portable Device Dependent
No outputs
Bus Interface
MXM-B (3.0)
PCIe 2.0 x16
Other
Production
End-of-life
End-of-life
Predecessor
Quadro Fermi-M
Tesla
Successor
Quadro Maxwell-M
Tesla Kepler