GPU Comparison
GEFORCE
NVIDIA P104-100
CORE STATE
GP104
VRAM
4 GB
CLOCK SPEED
1733 MHz
TDP
—
BUS WIDTH
256 bit
ARCHITECTURE
Pascal
PROCESS
16 nm
LAUNCH DATE
2017
VS
GEFORCE
Tesla C2070
CORE STATE
GF100
VRAM
6 GB
CLOCK SPEED
—
TDP
238 W
BUS WIDTH
384 bit
ARCHITECTURE
Fermi
PROCESS
40 nm
LAUNCH DATE
2011
PERFORMANCE BENCHMARKS
3dmark_3dmark_steel_nomad_dx12
geekbench_opencl
geekbench_vulkan
DETAILED SPECIFICATIONS
SPECIFICATION
P104-100
Tesla C2070
Core Specs
Shading Units
1,920
448
-76.7%
Shaders
1,920
448
-76.7%
TMUs
120
56
-53.3%
ROPs
64
48
-25.0%
SM Count
15
14
-6.7%
Clocks
Base Clock
1607 MHz
—
Boost Clock
1733 MHz
—
GPU Clock
—
574 MHz
Shader Clock
—
1147 MHz
Memory Clock
1251 MHz
10 Gbps effective
747 MHz
3 Gbps effective
Memory
Memory Size
4 GB
6 GB
VRAM (MB)
4,096
6,144
+50.0%
Memory Type
GDDR5X
GDDR5
Memory Bus
256 bit
384 bit
Bandwidth
320.3 GB/s
143.4 GB/s
Cache
L1 Cache
48 KB (per SM)
64 KB (per SM)
L2 Cache
2 MB
768 KB
Performance
Pixel Rate
110.9 GPixel/s
16.07 GPixel/s
Texture Rate
208.0 GTexel/s
32.14 GTexel/s
FP32 (TFLOPS)
6.655 TFLOPS
1,027.7 GFLOPS
FP64 (TFLOPS)
208.0 GFLOPS (1:32)
513.9 GFLOPS (1:2)
FP16 (TFLOPS)
104.0 GFLOPS (1:64)
—
Power
TDP
—
238 W
TDP (W)
—
238
Suggested PSU
200 W
550 W
Power Connectors
1x 8-pin
1x 6-pin + 1x 8-pin
Architecture
Architecture
Pascal
Fermi
GPU Name
GP104
GF100
Generation
Mining GPUs
Tesla Fermi
(x20xx)
Process Size
16 nm
40 nm
Transistors
7,200 million
3,100 million
Die Size
314 mm²
529 mm²
Foundry
TSMC
TSMC
Density
22.9M / mm²
5.9M / mm²
API Support
DirectX
12 (12_1)
12 (11_0)
OpenGL
4.6
4.6
Vulkan
1.4
—
OpenCL
3.0
1.1
CUDA
6.1
2.0
Shader Model
6.8
5.1
Physical
Slot Width
Dual-slot
Dual-slot
Length
267 mm
10.5 inches
248 mm
9.8 inches
Outputs
No outputs
1x DVI
Bus Interface
PCIe 1.0 x4
PCIe 2.0 x16
Other
Production
End-of-life
End-of-life
Predecessor
—
Tesla
Successor
—
Tesla Kepler