GPU Comparison

NVIDIA

GEFORCE

NVIDIA A10M

CORE STATE GA102

VRAM 20 GB

CLOCK SPEED 1635 MHz

TDP 150 W

BUS WIDTH 320 bit

ARCHITECTURE Ampere

nm

PROCESS 8 nm

LAUNCH DATE —

VS

NVIDIA

GEFORCE

Tesla M40

CORE STATE GM200

VRAM 12 GB

CLOCK SPEED 1112 MHz

TDP 250 W

BUS WIDTH 384 bit

ARCHITECTURE Maxwell 2.0

nm

PROCESS 28 nm

LAUNCH DATE 2015

PERFORMANCE BENCHMARKS

geekbench_opencl

135,230

39,192

geekbench_vulkan

N/A

44,602

DETAILED SPECIFICATIONS

SPECIFICATION

A10M

Tesla M40

Core Specs

Shading Units

7,168

3,072 -57.1%

Shaders

7,168

3,072 -57.1%

TMUs

224

192 -14.3%

ROPs

80

96 +20.0%

SM Count

56

—

Clocks

Base Clock

975 MHz

948 MHz

Boost Clock

1635 MHz

1112 MHz

Memory Clock

1563 MHz 12.5 Gbps effective

1502 MHz 6 Gbps effective

Memory

Memory Size

20 GB

12 GB

VRAM (MB)

20,480

12,288 -40.0%

Memory Type

GDDR6

GDDR5

Memory Bus

320 bit

384 bit

Bandwidth

500.2 GB/s

288.4 GB/s

Cache

L1 Cache

128 KB (per SM)

48 KB (per SMM)

L2 Cache

6 MB

3 MB

Performance

Pixel Rate

130.8 GPixel/s

106.8 GPixel/s

Texture Rate

366.2 GTexel/s

213.5 GTexel/s

FP32 (TFLOPS)

23.44 TFLOPS

6.832 TFLOPS

FP64 (TFLOPS)

732.5 GFLOPS (1:32)

213.5 GFLOPS (1:32)

FP16 (TFLOPS)

23.44 TFLOPS (1:1)

—

AI/RT

RT Cores

56

—

Tensor Cores

224

—

Power

TDP

150 W

250 W

TDP (W)

150

250 +66.7%

Suggested PSU

450 W

600 W

Power Connectors

8-pin EPS

8-pin EPS

Architecture

Architecture

Ampere

Maxwell 2.0

GPU Name

GA102

GM200

Generation

Server Ampere (Axx)

Tesla Maxwell (Mxx)

Process Size

8 nm

28 nm

Transistors

28,300 million

8,000 million

Die Size

628 mm²

601 mm²

Foundry

Samsung

TSMC

Density

45.1M / mm²

13.3M / mm²

API Support

DirectX

12 Ultimate (12_2)

12 (12_1)

OpenGL

4.6

4.6

Vulkan

1.4

1.4

OpenCL

3.0

3.0

CUDA

8.6

5.2

Shader Model

6.8

6.8

Physical

Slot Width

Single-slot

Dual-slot

Length

267 mm 10.5 inches

267 mm 10.5 inches

Height

112 mm 4.4 inches

—

Outputs

No outputs

No outputs

Bus Interface

PCIe 4.0 x16

PCIe 3.0 x16

Other

Production

End-of-life

End-of-life

Predecessor

Tesla Turing

Tesla Kepler

Successor

Server Ada

Tesla Pascal

View A10M Details View Tesla M40 Details