GPU Comparison
GEFORCE
NVIDIA A2
CORE STATE
GA107
VRAM
16 GB
CLOCK SPEED
1770 MHz
TDP
60 W
BUS WIDTH
128 bit
ARCHITECTURE
Ampere
PROCESS
8 nm
LAUNCH DATE
2021
VS
GEFORCE
Tesla P4
CORE STATE
GP104
VRAM
8 GB
CLOCK SPEED
1114 MHz
TDP
75 W
BUS WIDTH
256 bit
ARCHITECTURE
Pascal
PROCESS
16 nm
LAUNCH DATE
2016
PERFORMANCE BENCHMARKS
geekbench_opencl
geekbench_vulkan
DETAILED SPECIFICATIONS
SPECIFICATION
A2
Tesla P4
Core Specs
Shading Units
1,280
2,560
+100.0%
Shaders
1,280
2,560
+100.0%
TMUs
40
160
+300.0%
ROPs
32
64
+100.0%
SM Count
10
20
+100.0%
Clocks
Base Clock
1440 MHz
886 MHz
Boost Clock
1770 MHz
1114 MHz
Memory Clock
1563 MHz
12.5 Gbps effective
1502 MHz
6 Gbps effective
Memory
Memory Size
16 GB
8 GB
VRAM (MB)
16,384
8,192
-50.0%
Memory Type
GDDR6
GDDR5
Memory Bus
128 bit
256 bit
Bandwidth
200.1 GB/s
192.3 GB/s
Cache
L1 Cache
128 KB (per SM)
48 KB (per SM)
L2 Cache
2 MB
2 MB
Performance
Pixel Rate
56.64 GPixel/s
71.30 GPixel/s
Texture Rate
70.80 GTexel/s
178.2 GTexel/s
FP32 (TFLOPS)
4.531 TFLOPS
5.704 TFLOPS
FP64 (TFLOPS)
70.80 GFLOPS (1:64)
178.2 GFLOPS (1:32)
FP16 (TFLOPS)
4.531 TFLOPS (1:1)
89.12 GFLOPS (1:64)
AI/RT
RT Cores
10
—
Tensor Cores
40
—
Power
TDP
60 W
75 W
TDP (W)
60
75
+25.0%
Suggested PSU
250 W
250 W
Power Connectors
None
None
Architecture
Architecture
Ampere
Pascal
GPU Name
GA107
GP104
Generation
Workstation Ampere
(Ax000)
Tesla Pascal
(Pxx)
Process Size
8 nm
16 nm
Transistors
8,700 million
7,200 million
Die Size
200 mm²
314 mm²
Foundry
Samsung
TSMC
Density
43.5M / mm²
22.9M / mm²
API Support
DirectX
12 Ultimate (12_2)
12 (12_1)
OpenGL
4.6
4.6
Vulkan
1.4
1.4
OpenCL
3.0
3.0
CUDA
8.6
6.1
Shader Model
6.8
6.8
Physical
Slot Width
Single-slot
Single-slot
Length
—
168 mm
6.6 inches
Outputs
No outputs
No outputs
Bus Interface
PCIe 4.0 x8
PCIe 3.0 x16
Other
Production
End-of-life
End-of-life
Predecessor
Quadro Turing
Tesla Maxwell
Successor
Workstation Ada
Tesla Volta