GPU Comparison

NVIDIA

GEFORCE

NVIDIA A10M

CORE STATE GA102

VRAM 20 GB

CLOCK SPEED 1635 MHz

TDP 150 W

BUS WIDTH 320 bit

ARCHITECTURE Ampere

nm

PROCESS 8 nm

LAUNCH DATE —

VS

NVIDIA

GEFORCE

L40

CORE STATE AD102

VRAM 48 GB

CLOCK SPEED 2490 MHz

TDP 300 W

BUS WIDTH 384 bit

ARCHITECTURE Ada Lovelace

nm

PROCESS 5 nm

LAUNCH DATE 2022

PERFORMANCE BENCHMARKS

geekbench_opencl

135,230

330,683

geekbench_vulkan

N/A

232,627

DETAILED SPECIFICATIONS

SPECIFICATION

A10M

L40

Core Specs

Shading Units

7,168

18,176 +153.6%

Shaders

7,168

18,176 +153.6%

TMUs

224

568 +153.6%

ROPs

80

192 +140.0%

SM Count

56

142 +153.6%

Clocks

Base Clock

975 MHz

735 MHz

Boost Clock

1635 MHz

2490 MHz

Memory Clock

1563 MHz 12.5 Gbps effective

2250 MHz 18 Gbps effective

Memory

Memory Size

20 GB

48 GB

VRAM (MB)

20,480

49,152 +140.0%

Memory Type

GDDR6

GDDR6

Memory Bus

320 bit

384 bit

Bandwidth

500.2 GB/s

864.0 GB/s

Cache

L1 Cache

128 KB (per SM)

128 KB (per SM)

L2 Cache

6 MB

96 MB

Performance

Pixel Rate

130.8 GPixel/s

478.1 GPixel/s

Texture Rate

366.2 GTexel/s

1,414.3 GTexel/s

FP32 (TFLOPS)

23.44 TFLOPS

90.52 TFLOPS

FP64 (TFLOPS)

732.5 GFLOPS (1:32)

1,414.3 GFLOPS (1:64)

FP16 (TFLOPS)

23.44 TFLOPS (1:1)

90.52 TFLOPS (1:1)

AI/RT

RT Cores

56

142 +153.6%

Tensor Cores

224

568 +153.6%

Power

TDP

150 W

300 W

TDP (W)

150

300 +100.0%

Suggested PSU

450 W

700 W

Power Connectors

8-pin EPS

1x 16-pin

Architecture

Architecture

Ampere

Ada Lovelace

GPU Name

GA102

AD102

Generation

Server Ampere (Axx)

Server Ada (Lxx)

Process Size

8 nm

5 nm

Transistors

28,300 million

76,300 million

Die Size

628 mm²

609 mm²

Foundry

Samsung

TSMC

Density

45.1M / mm²

125.3M / mm²

API Support

DirectX

12 Ultimate (12_2)

12 Ultimate (12_2)

OpenGL

4.6

4.6

Vulkan

1.4

1.4

OpenCL

3.0

3.0

CUDA

8.6

8.9

Shader Model

6.8

6.8

Physical

Slot Width

Single-slot

Dual-slot

Length

267 mm 10.5 inches

267 mm 10.5 inches

Height

112 mm 4.4 inches

111 mm 4.4 inches

Outputs

No outputs

4x DisplayPort 1.4a

Bus Interface

PCIe 4.0 x16

PCIe 4.0 x16

Other

Production

End-of-life

End-of-life

Predecessor

Tesla Turing

Server Ampere

Successor

Server Ada

Server Hopper

View A10M Details View L40 Details