GPU Comparison

NVIDIA

GEFORCE

NVIDIA A10G

CORE STATE GA102

VRAM 24 GB

CLOCK SPEED 1710 MHz

TDP 150 W

BUS WIDTH 384 bit

ARCHITECTURE Ampere

nm

PROCESS 8 nm

LAUNCH DATE 2021

VS

NVIDIA

GEFORCE

Tesla M40

CORE STATE GM200

VRAM 12 GB

CLOCK SPEED 1112 MHz

TDP 250 W

BUS WIDTH 384 bit

ARCHITECTURE Maxwell 2.0

nm

PROCESS 28 nm

LAUNCH DATE 2015

PERFORMANCE BENCHMARKS

geekbench_opencl

158,063

39,192

geekbench_vulkan

145,863

44,602

DETAILED SPECIFICATIONS

SPECIFICATION

A10G

Tesla M40

Core Specs

Shading Units

9,216

3,072 -66.7%

Shaders

9,216

3,072 -66.7%

TMUs

288

192 -33.3%

ROPs

96

96 0.0%

SM Count

72

—

Clocks

Base Clock

1320 MHz

948 MHz

Boost Clock

1710 MHz

1112 MHz

Memory Clock

1563 MHz 12.5 Gbps effective

1502 MHz 6 Gbps effective

Memory

Memory Size

24 GB

12 GB

VRAM (MB)

24,576

12,288 -50.0%

Memory Type

GDDR6

GDDR5

Memory Bus

384 bit

384 bit

Bandwidth

600.2 GB/s

288.4 GB/s

Cache

L1 Cache

128 KB (per SM)

48 KB (per SMM)

L2 Cache

6 MB

3 MB

Performance

Pixel Rate

164.2 GPixel/s

106.8 GPixel/s

Texture Rate

492.5 GTexel/s

213.5 GTexel/s

FP32 (TFLOPS)

31.52 TFLOPS

6.832 TFLOPS

FP64 (TFLOPS)

985.0 GFLOPS (1:32)

213.5 GFLOPS (1:32)

FP16 (TFLOPS)

31.52 TFLOPS (1:1)

—

AI/RT

RT Cores

72

—

Tensor Cores

288

—

Power

TDP

150 W

250 W

TDP (W)

150

250 +66.7%

Suggested PSU

450 W

600 W

Power Connectors

8-pin EPS

8-pin EPS

Architecture

Architecture

Ampere

Maxwell 2.0

GPU Name

GA102

GM200

Generation

Server Ampere (Axx)

Tesla Maxwell (Mxx)

Process Size

8 nm

28 nm

Transistors

28,300 million

8,000 million

Die Size

628 mm²

601 mm²

Foundry

Samsung

TSMC

Density

45.1M / mm²

13.3M / mm²

API Support

DirectX

12 Ultimate (12_2)

12 (12_1)

OpenGL

4.6

4.6

Vulkan

1.4

1.4

OpenCL

3.0

3.0

CUDA

8.6

5.2

Shader Model

6.8

6.8

Physical

Slot Width

Single-slot

Dual-slot

Length

267 mm 10.5 inches

267 mm 10.5 inches

Height

112 mm 4.4 inches

—

Outputs

No outputs

No outputs

Bus Interface

PCIe 4.0 x16

PCIe 3.0 x16

Other

Production

End-of-life

End-of-life

Predecessor

Tesla Turing

Tesla Kepler

Successor

Server Ada

Tesla Pascal

View A10G Details View Tesla M40 Details