GPU Comparison
RADEON
AMD Radeon Pro Duo
CORE STATE
Capsaicin
VRAM
4 GB
CLOCK SPEED
—
TDP
350 W
BUS WIDTH
4096 bit
ARCHITECTURE
GCN 3.0
PROCESS
28 nm
LAUNCH DATE
2016
VS
GEFORCE
Tesla P40
CORE STATE
GP102
VRAM
24 GB
CLOCK SPEED
1531 MHz
TDP
250 W
BUS WIDTH
384 bit
ARCHITECTURE
Pascal
PROCESS
16 nm
LAUNCH DATE
2016
PERFORMANCE BENCHMARKS
geekbench_opencl
geekbench_vulkan
DETAILED SPECIFICATIONS
SPECIFICATION
Pro Duo
Tesla P40
Core Specs
Shading Units
4,096
3,840
-6.3%
Shaders
4,096
3,840
-6.3%
TMUs
256
240
-6.3%
ROPs
64
96
+50.0%
Compute Units
64
—
SM Count
—
30
Clocks
Base Clock
—
1303 MHz
Boost Clock
—
1531 MHz
GPU Clock
1000 MHz
—
Memory Clock
500 MHz
1000 Mbps effective
1808 MHz
7.2 Gbps effective
Memory
Memory Size
4 GB
24 GB
VRAM (MB)
4,096
24,576
+500.0%
Memory Type
HBM
GDDR5
Memory Bus
4096 bit
384 bit
Bandwidth
512.0 GB/s
347.1 GB/s
Cache
L1 Cache
16 KB (per CU)
48 KB (per SM)
L2 Cache
2 MB
3 MB
Performance
Pixel Rate
64.00 GPixel/s
147.0 GPixel/s
Texture Rate
256.0 GTexel/s
367.4 GTexel/s
FP32 (TFLOPS)
8.192 TFLOPS
11.76 TFLOPS
FP64 (TFLOPS)
512.0 GFLOPS (1:16)
367.4 GFLOPS (1:32)
FP16 (TFLOPS)
8.192 TFLOPS (1:1)
183.7 GFLOPS (1:64)
Power
TDP
350 W
250 W
TDP (W)
350
250
-28.6%
Suggested PSU
750 W
600 W
Power Connectors
3x 8-pin
8-pin EPS
Architecture
Architecture
GCN 3.0
Pascal
GPU Name
Capsaicin
GP102
Generation
Radeon Pro GCN
Tesla Pascal
(Pxx)
Process Size
28 nm
16 nm
Transistors
8,900 million
11,800 million
Die Size
596 mm²
471 mm²
Foundry
TSMC
TSMC
Density
14.9M / mm²
25.1M / mm²
API Support
DirectX
12 (12_0)
12 (12_1)
OpenGL
4.6
4.6
Vulkan
1.2.170
1.4
OpenCL
2.1
3.0
CUDA
—
6.1
Shader Model
6.5
6.8
Physical
Slot Width
Dual-slot
Dual-slot
Length
277 mm
10.9 inches
267 mm
10.5 inches
Height
111 mm
4.4 inches
111 mm
4.4 inches
Outputs
1x HDMI 1.4a3x DisplayPort 1.2
No outputs
Bus Interface
PCIe 3.0 x16
PCIe 3.0 x16
Other
Launch Price
1,499 USD
5,699 USD
Production
End-of-life
End-of-life
Predecessor
FirePro GCN
Tesla Maxwell
Successor
Radeon Pro Polaris
Tesla Volta