GPU Comparison

NVIDIA

GEFORCE

NVIDIA A10M

CORE STATE GA102

VRAM 20 GB

CLOCK SPEED 1635 MHz

TDP 150 W

BUS WIDTH 320 bit

ARCHITECTURE Ampere

nm

PROCESS 8 nm

LAUNCH DATE —

VS

NVIDIA

GEFORCE

Tesla P4

CORE STATE GP104

VRAM 8 GB

CLOCK SPEED 1114 MHz

TDP 75 W

BUS WIDTH 256 bit

ARCHITECTURE Pascal

nm

PROCESS 16 nm

LAUNCH DATE 2016

PERFORMANCE BENCHMARKS

geekbench_opencl

135,230

37,896

geekbench_vulkan

N/A

40,476

DETAILED SPECIFICATIONS

SPECIFICATION

A10M

Tesla P4

Core Specs

Shading Units

7,168

2,560 -64.3%

Shaders

7,168

2,560 -64.3%

TMUs

224

160 -28.6%

ROPs

80

64 -20.0%

SM Count

56

20 -64.3%

Clocks

Base Clock

975 MHz

886 MHz

Boost Clock

1635 MHz

1114 MHz

Memory Clock

1563 MHz 12.5 Gbps effective

1502 MHz 6 Gbps effective

Memory

Memory Size

20 GB

8 GB

VRAM (MB)

20,480

8,192 -60.0%

Memory Type

GDDR6

GDDR5

Memory Bus

320 bit

256 bit

Bandwidth

500.2 GB/s

192.3 GB/s

Cache

L1 Cache

128 KB (per SM)

48 KB (per SM)

L2 Cache

6 MB

2 MB

Performance

Pixel Rate

130.8 GPixel/s

71.30 GPixel/s

Texture Rate

366.2 GTexel/s

178.2 GTexel/s

FP32 (TFLOPS)

23.44 TFLOPS

5.704 TFLOPS

FP64 (TFLOPS)

732.5 GFLOPS (1:32)

178.2 GFLOPS (1:32)

FP16 (TFLOPS)

23.44 TFLOPS (1:1)

89.12 GFLOPS (1:64)

AI/RT

RT Cores

56

—

Tensor Cores

224

—

Power

TDP

150 W

75 W

TDP (W)

150

75 -50.0%

Suggested PSU

450 W

250 W

Power Connectors

8-pin EPS

None

Architecture

Architecture

Ampere

Pascal

GPU Name

GA102

GP104

Generation

Server Ampere (Axx)

Tesla Pascal (Pxx)

Process Size

8 nm

16 nm

Transistors

28,300 million

7,200 million

Die Size

628 mm²

314 mm²

Foundry

Samsung

TSMC

Density

45.1M / mm²

22.9M / mm²

API Support

DirectX

12 Ultimate (12_2)

12 (12_1)

OpenGL

4.6

4.6

Vulkan

1.4

1.4

OpenCL

3.0

3.0

CUDA

8.6

6.1

Shader Model

6.8

6.8

Physical

Slot Width

Single-slot

Single-slot

Length

267 mm 10.5 inches

168 mm 6.6 inches

Height

112 mm 4.4 inches

—

Outputs

No outputs

No outputs

Bus Interface

PCIe 4.0 x16

PCIe 3.0 x16

Other

Production

End-of-life

End-of-life

Predecessor

Tesla Turing

Tesla Maxwell

Successor

Server Ada

Tesla Volta

View A10M Details View Tesla P4 Details