GPU Comparison
GEFORCE
NVIDIA P104-100
CORE STATE
GP104
VRAM
4 GB
CLOCK SPEED
1733 MHz
TDP
—
BUS WIDTH
256 bit
ARCHITECTURE
Pascal
PROCESS
16 nm
LAUNCH DATE
2017
VS
GEFORCE
Tesla M2090
CORE STATE
GF110
VRAM
6 GB
CLOCK SPEED
—
TDP
250 W
BUS WIDTH
384 bit
ARCHITECTURE
Fermi 2.0
PROCESS
40 nm
LAUNCH DATE
2011
PERFORMANCE BENCHMARKS
3dmark_3dmark_steel_nomad_dx12
geekbench_opencl
geekbench_vulkan
DETAILED SPECIFICATIONS
SPECIFICATION
P104-100
Tesla M2090
Core Specs
Shading Units
1,920
512
-73.3%
Shaders
1,920
512
-73.3%
TMUs
120
64
-46.7%
ROPs
64
48
-25.0%
SM Count
15
16
+6.7%
Clocks
Base Clock
1607 MHz
—
Boost Clock
1733 MHz
—
GPU Clock
—
651 MHz
Shader Clock
—
1301 MHz
Memory Clock
1251 MHz
10 Gbps effective
924 MHz
3.7 Gbps effective
Memory
Memory Size
4 GB
6 GB
VRAM (MB)
4,096
6,144
+50.0%
Memory Type
GDDR5X
GDDR5
Memory Bus
256 bit
384 bit
Bandwidth
320.3 GB/s
177.4 GB/s
Cache
L1 Cache
48 KB (per SM)
64 KB (per SM)
L2 Cache
2 MB
768 KB
Performance
Pixel Rate
110.9 GPixel/s
20.83 GPixel/s
Texture Rate
208.0 GTexel/s
41.66 GTexel/s
FP32 (TFLOPS)
6.655 TFLOPS
1,332.2 GFLOPS
FP64 (TFLOPS)
208.0 GFLOPS (1:32)
666.1 GFLOPS (1:2)
FP16 (TFLOPS)
104.0 GFLOPS (1:64)
—
Power
TDP
—
250 W
TDP (W)
—
250
Suggested PSU
200 W
600 W
Power Connectors
1x 8-pin
1x 6-pin + 1x 8-pin
Architecture
Architecture
Pascal
Fermi 2.0
GPU Name
GP104
GF110
Generation
Mining GPUs
Tesla Fermi
(x20xx)
Process Size
16 nm
40 nm
Transistors
7,200 million
3,000 million
Die Size
314 mm²
520 mm²
Foundry
TSMC
TSMC
Density
22.9M / mm²
5.8M / mm²
API Support
DirectX
12 (12_1)
12 (11_0)
OpenGL
4.6
4.6
Vulkan
1.4
—
OpenCL
3.0
1.1
CUDA
6.1
2.0
Shader Model
6.8
5.1
Physical
Slot Width
Dual-slot
Dual-slot
Length
267 mm
10.5 inches
248 mm
9.8 inches
Outputs
No outputs
No outputs
Bus Interface
PCIe 1.0 x4
PCIe 2.0 x16
Other
Production
End-of-life
End-of-life
Predecessor
—
Tesla
Successor
—
Tesla Kepler