GPU Comparison
GEFORCE
NVIDIA P104-100
CORE STATE
GP104
VRAM
4 GB
CLOCK SPEED
1733 MHz
TDP
—
BUS WIDTH
256 bit
ARCHITECTURE
Pascal
PROCESS
16 nm
LAUNCH DATE
2017
VS
GEFORCE
RTX A400
CORE STATE
GA107
VRAM
4 GB
CLOCK SPEED
1762 MHz
TDP
50 W
BUS WIDTH
64 bit
ARCHITECTURE
Ampere
PROCESS
8 nm
LAUNCH DATE
2024
PERFORMANCE BENCHMARKS
3dmark_3dmark_steel_nomad_dx12
geekbench_opencl
geekbench_vulkan
DETAILED SPECIFICATIONS
SPECIFICATION
P104-100
RTX A400
Core Specs
Shading Units
1,920
768
-60.0%
Shaders
1,920
768
-60.0%
TMUs
120
24
-80.0%
ROPs
64
16
-75.0%
SM Count
15
6
-60.0%
Clocks
Base Clock
1607 MHz
1417 MHz
Boost Clock
1733 MHz
1762 MHz
Memory Clock
1251 MHz
10 Gbps effective
1500 MHz
12 Gbps effective
Memory
Memory Size
4 GB
4 GB
VRAM (MB)
4,096
4,096
0.0%
Memory Type
GDDR5X
GDDR6
Memory Bus
256 bit
64 bit
Bandwidth
320.3 GB/s
96.00 GB/s
Cache
L1 Cache
48 KB (per SM)
128 KB (per SM)
L2 Cache
2 MB
2 MB
Performance
Pixel Rate
110.9 GPixel/s
28.19 GPixel/s
Texture Rate
208.0 GTexel/s
42.29 GTexel/s
FP32 (TFLOPS)
6.655 TFLOPS
2.706 TFLOPS
FP64 (TFLOPS)
208.0 GFLOPS (1:32)
42.29 GFLOPS (1:64)
FP16 (TFLOPS)
104.0 GFLOPS (1:64)
2.706 TFLOPS (1:1)
AI/RT
RT Cores
—
6
Tensor Cores
—
24
Power
TDP
—
50 W
TDP (W)
—
50
Suggested PSU
200 W
250 W
Power Connectors
1x 8-pin
None
Architecture
Architecture
Pascal
Ampere
GPU Name
GP104
GA107
Generation
Mining GPUs
Workstation Ampere
(Ax000)
Process Size
16 nm
8 nm
Transistors
7,200 million
8,700 million
Die Size
314 mm²
200 mm²
Foundry
TSMC
Samsung
Density
22.9M / mm²
43.5M / mm²
API Support
DirectX
12 (12_1)
12 Ultimate (12_2)
OpenGL
4.6
4.6
Vulkan
1.4
1.4
OpenCL
3.0
3.0
CUDA
6.1
8.6
Shader Model
6.8
6.8
Physical
Slot Width
Dual-slot
Single-slot
Length
267 mm
10.5 inches
163 mm
6.4 inches
Height
—
69 mm
2.7 inches
Outputs
No outputs
4x mini-DisplayPort 1.4a
Bus Interface
PCIe 1.0 x4
PCIe 4.0 x8
Other
Production
End-of-life
Active
Predecessor
—
Quadro Turing
Successor
—
Workstation Ada