GPU Comparison

NVIDIA
GEFORCE

NVIDIA A10M

CORE STATE GA102
VRAM 20 GB
CLOCK SPEED 1635 MHz
TDP 150 W
BUS WIDTH 320 bit
ARCHITECTURE Ampere
nm
PROCESS 8 nm
LAUNCH DATE
VS
NVIDIA
GEFORCE

Quadro 4000M

CORE STATE GF104
VRAM 2 GB
CLOCK SPEED
TDP 100 W
BUS WIDTH 256 bit
ARCHITECTURE Fermi
nm
PROCESS 40 nm
LAUNCH DATE 2011

PERFORMANCE BENCHMARKS

geekbench_opencl
135,230
5,212

DETAILED SPECIFICATIONS

SPECIFICATION
A10M
Quadro 4000M
Core Specs
Shading Units
7,168
336 -95.3%
Shaders
7,168
336 -95.3%
TMUs
224
56 -75.0%
ROPs
80
32 -60.0%
SM Count
56
7 -87.5%
Clocks
Base Clock
975 MHz
Boost Clock
1635 MHz
GPU Clock
475 MHz
Shader Clock
950 MHz
Memory Clock
1563 MHz 12.5 Gbps effective
625 MHz 2.5 Gbps effective
Memory
Memory Size
20 GB
2 GB
VRAM (MB)
20,480
2,048 -90.0%
Memory Type
GDDR6
GDDR5
Memory Bus
320 bit
256 bit
Bandwidth
500.2 GB/s
80.00 GB/s
Cache
L1 Cache
128 KB (per SM)
64 KB (per SM)
L2 Cache
6 MB
512 KB
Performance
Pixel Rate
130.8 GPixel/s
6.650 GPixel/s
Texture Rate
366.2 GTexel/s
26.60 GTexel/s
FP32 (TFLOPS)
23.44 TFLOPS
638.4 GFLOPS
FP64 (TFLOPS)
732.5 GFLOPS (1:32)
53.20 GFLOPS (1:12)
FP16 (TFLOPS)
23.44 TFLOPS (1:1)
AI/RT
RT Cores
56
Tensor Cores
224
Power
TDP
150 W
100 W
TDP (W)
150
100 -33.3%
Suggested PSU
450 W
Power Connectors
8-pin EPS
None
Architecture
Architecture
Ampere
Fermi
GPU Name
GA102
GF104
Generation
Server Ampere (Axx)
Quadro Fermi-M (x000M)
Process Size
8 nm
40 nm
Transistors
28,300 million
1,950 million
Die Size
628 mm²
332 mm²
Foundry
Samsung
TSMC
Density
45.1M / mm²
5.9M / mm²
API Support
DirectX
12 Ultimate (12_2)
12 (11_0)
OpenGL
4.6
4.6
Vulkan
1.4
OpenCL
3.0
1.1
CUDA
8.6
2.1
Shader Model
6.8
5.1
Physical
Slot Width
Single-slot
MXM Module
Length
267 mm 10.5 inches
Height
112 mm 4.4 inches
Outputs
No outputs
Portable Device Dependent
Bus Interface
PCIe 4.0 x16
MXM-B (3.0)
Other
Production
End-of-life
End-of-life
Predecessor
Tesla Turing
Quadro FX Mobile
Successor
Server Ada
Quadro Kepler-M
View A10M Details View Quadro 4000M Details