GPU Comparison
GEFORCE
NVIDIA A10M
CORE STATE
GA102
VRAM
20 GB
CLOCK SPEED
1635 MHz
TDP
150 W
BUS WIDTH
320 bit
ARCHITECTURE
Ampere
PROCESS
8 nm
LAUNCH DATE
—
VS
GEFORCE
L40
CORE STATE
AD102
VRAM
48 GB
CLOCK SPEED
2490 MHz
TDP
300 W
BUS WIDTH
384 bit
ARCHITECTURE
Ada Lovelace
PROCESS
5 nm
LAUNCH DATE
2022
PERFORMANCE BENCHMARKS
geekbench_opencl
geekbench_vulkan
DETAILED SPECIFICATIONS
SPECIFICATION
A10M
L40
Core Specs
Shading Units
7,168
18,176
+153.6%
Shaders
7,168
18,176
+153.6%
TMUs
224
568
+153.6%
ROPs
80
192
+140.0%
SM Count
56
142
+153.6%
Clocks
Base Clock
975 MHz
735 MHz
Boost Clock
1635 MHz
2490 MHz
Memory Clock
1563 MHz
12.5 Gbps effective
2250 MHz
18 Gbps effective
Memory
Memory Size
20 GB
48 GB
VRAM (MB)
20,480
49,152
+140.0%
Memory Type
GDDR6
GDDR6
Memory Bus
320 bit
384 bit
Bandwidth
500.2 GB/s
864.0 GB/s
Cache
L1 Cache
128 KB (per SM)
128 KB (per SM)
L2 Cache
6 MB
96 MB
Performance
Pixel Rate
130.8 GPixel/s
478.1 GPixel/s
Texture Rate
366.2 GTexel/s
1,414.3 GTexel/s
FP32 (TFLOPS)
23.44 TFLOPS
90.52 TFLOPS
FP64 (TFLOPS)
732.5 GFLOPS (1:32)
1,414.3 GFLOPS (1:64)
FP16 (TFLOPS)
23.44 TFLOPS (1:1)
90.52 TFLOPS (1:1)
AI/RT
RT Cores
56
142
+153.6%
Tensor Cores
224
568
+153.6%
Power
TDP
150 W
300 W
TDP (W)
150
300
+100.0%
Suggested PSU
450 W
700 W
Power Connectors
8-pin EPS
1x 16-pin
Architecture
Architecture
Ampere
Ada Lovelace
GPU Name
GA102
AD102
Generation
Server Ampere
(Axx)
Server Ada
(Lxx)
Process Size
8 nm
5 nm
Transistors
28,300 million
76,300 million
Die Size
628 mm²
609 mm²
Foundry
Samsung
TSMC
Density
45.1M / mm²
125.3M / mm²
API Support
DirectX
12 Ultimate (12_2)
12 Ultimate (12_2)
OpenGL
4.6
4.6
Vulkan
1.4
1.4
OpenCL
3.0
3.0
CUDA
8.6
8.9
Shader Model
6.8
6.8
Physical
Slot Width
Single-slot
Dual-slot
Length
267 mm
10.5 inches
267 mm
10.5 inches
Height
112 mm
4.4 inches
111 mm
4.4 inches
Outputs
No outputs
4x DisplayPort 1.4a
Bus Interface
PCIe 4.0 x16
PCIe 4.0 x16
Other
Production
End-of-life
End-of-life
Predecessor
Tesla Turing
Server Ampere
Successor
Server Ada
Server Hopper