GPU Comparison
GEFORCE
NVIDIA A10M
CORE STATE
GA102
VRAM
20 GB
CLOCK SPEED
1635 MHz
TDP
150 W
BUS WIDTH
320 bit
ARCHITECTURE
Ampere
PROCESS
8 nm
LAUNCH DATE
—
VS
GEFORCE
Quadro 4000
CORE STATE
GF100
VRAM
2 GB
CLOCK SPEED
—
TDP
142 W
BUS WIDTH
256 bit
ARCHITECTURE
Fermi
PROCESS
40 nm
LAUNCH DATE
2010
PERFORMANCE BENCHMARKS
geekbench_opencl
DETAILED SPECIFICATIONS
SPECIFICATION
A10M
Quadro 4000
Core Specs
Shading Units
7,168
256
-96.4%
Shaders
7,168
256
-96.4%
TMUs
224
32
-85.7%
ROPs
80
32
-60.0%
SM Count
56
8
-85.7%
Clocks
Base Clock
975 MHz
—
Boost Clock
1635 MHz
—
GPU Clock
—
475 MHz
Shader Clock
—
950 MHz
Memory Clock
1563 MHz
12.5 Gbps effective
702 MHz
2.8 Gbps effective
Memory
Memory Size
20 GB
2 GB
VRAM (MB)
20,480
2,048
-90.0%
Memory Type
GDDR6
GDDR5
Memory Bus
320 bit
256 bit
Bandwidth
500.2 GB/s
89.86 GB/s
Cache
L1 Cache
128 KB (per SM)
64 KB (per SM)
L2 Cache
6 MB
512 KB
Performance
Pixel Rate
130.8 GPixel/s
7.600 GPixel/s
Texture Rate
366.2 GTexel/s
15.20 GTexel/s
FP32 (TFLOPS)
23.44 TFLOPS
486.4 GFLOPS
FP64 (TFLOPS)
732.5 GFLOPS (1:32)
243.2 GFLOPS (1:2)
FP16 (TFLOPS)
23.44 TFLOPS (1:1)
—
AI/RT
RT Cores
56
—
Tensor Cores
224
—
Power
TDP
150 W
142 W
TDP (W)
150
142
-5.3%
Suggested PSU
450 W
300 W
Power Connectors
8-pin EPS
1x 6-pin
Architecture
Architecture
Ampere
Fermi
GPU Name
GA102
GF100
Generation
Server Ampere
(Axx)
Quadro Fermi
(x000)
Process Size
8 nm
40 nm
Transistors
28,300 million
3,100 million
Die Size
628 mm²
529 mm²
Foundry
Samsung
TSMC
Density
45.1M / mm²
5.9M / mm²
API Support
DirectX
12 Ultimate (12_2)
12 (11_0)
OpenGL
4.6
4.6
Vulkan
1.4
—
OpenCL
3.0
1.1
CUDA
8.6
2.0
Shader Model
6.8
5.1
Physical
Slot Width
Single-slot
Single-slot
Length
267 mm
10.5 inches
241 mm
9.5 inches
Height
112 mm
4.4 inches
111 mm
4.4 inches
Outputs
No outputs
1x DVI2x DisplayPort
Bus Interface
PCIe 4.0 x16
PCIe 2.0 x16
Other
Launch Price
—
1,199 USD
Production
End-of-life
End-of-life
Predecessor
Tesla Turing
Quadro FX Tesla
Successor
Server Ada
Quadro Kepler