GPU Comparison

NVIDIA
GEFORCE

NVIDIA A10M

CORE STATE GA102
VRAM 20 GB
CLOCK SPEED 1635 MHz
TDP 150 W
BUS WIDTH 320 bit
ARCHITECTURE Ampere
nm
PROCESS 8 nm
LAUNCH DATE
VS
NVIDIA
GEFORCE

P104-100

CORE STATE GP104
VRAM 4 GB
CLOCK SPEED 1733 MHz
TDP
BUS WIDTH 256 bit
ARCHITECTURE Pascal
nm
PROCESS 16 nm
LAUNCH DATE 2017

PERFORMANCE BENCHMARKS

geekbench_opencl
135,230
51,663
3dmark_3dmark_steel_nomad_dx12
N/A
1,413
geekbench_vulkan
N/A
45,165

DETAILED SPECIFICATIONS

SPECIFICATION
A10M
P104-100
Core Specs
Shading Units
7,168
1,920 -73.2%
Shaders
7,168
1,920 -73.2%
TMUs
224
120 -46.4%
ROPs
80
64 -20.0%
SM Count
56
15 -73.2%
Clocks
Base Clock
975 MHz
1607 MHz
Boost Clock
1635 MHz
1733 MHz
Memory Clock
1563 MHz 12.5 Gbps effective
1251 MHz 10 Gbps effective
Memory
Memory Size
20 GB
4 GB
VRAM (MB)
20,480
4,096 -80.0%
Memory Type
GDDR6
GDDR5X
Memory Bus
320 bit
256 bit
Bandwidth
500.2 GB/s
320.3 GB/s
Cache
L1 Cache
128 KB (per SM)
48 KB (per SM)
L2 Cache
6 MB
2 MB
Performance
Pixel Rate
130.8 GPixel/s
110.9 GPixel/s
Texture Rate
366.2 GTexel/s
208.0 GTexel/s
FP32 (TFLOPS)
23.44 TFLOPS
6.655 TFLOPS
FP64 (TFLOPS)
732.5 GFLOPS (1:32)
208.0 GFLOPS (1:32)
FP16 (TFLOPS)
23.44 TFLOPS (1:1)
104.0 GFLOPS (1:64)
AI/RT
RT Cores
56
Tensor Cores
224
Power
TDP
150 W
TDP (W)
150
Suggested PSU
450 W
200 W
Power Connectors
8-pin EPS
1x 8-pin
Architecture
Architecture
Ampere
Pascal
GPU Name
GA102
GP104
Generation
Server Ampere (Axx)
Mining GPUs
Process Size
8 nm
16 nm
Transistors
28,300 million
7,200 million
Die Size
628 mm²
314 mm²
Foundry
Samsung
TSMC
Density
45.1M / mm²
22.9M / mm²
API Support
DirectX
12 Ultimate (12_2)
12 (12_1)
OpenGL
4.6
4.6
Vulkan
1.4
1.4
OpenCL
3.0
3.0
CUDA
8.6
6.1
Shader Model
6.8
6.8
Physical
Slot Width
Single-slot
Dual-slot
Length
267 mm 10.5 inches
267 mm 10.5 inches
Height
112 mm 4.4 inches
Outputs
No outputs
No outputs
Bus Interface
PCIe 4.0 x16
PCIe 1.0 x4
Other
Production
End-of-life
End-of-life
Predecessor
Tesla Turing
Successor
Server Ada
View A10M Details View P104-100 Details