GPU Comparison

NVIDIA
GEFORCE

NVIDIA A10M

CORE STATE GA102
VRAM 20 GB
CLOCK SPEED 1635 MHz
TDP 150 W
BUS WIDTH 320 bit
ARCHITECTURE Ampere
nm
PROCESS 8 nm
LAUNCH DATE
VS
NVIDIA
GEFORCE

Tesla M40

CORE STATE GM200
VRAM 12 GB
CLOCK SPEED 1112 MHz
TDP 250 W
BUS WIDTH 384 bit
ARCHITECTURE Maxwell 2.0
nm
PROCESS 28 nm
LAUNCH DATE 2015

PERFORMANCE BENCHMARKS

geekbench_opencl
135,230
39,192
geekbench_vulkan
N/A
44,602

DETAILED SPECIFICATIONS

SPECIFICATION
A10M
Tesla M40
Core Specs
Shading Units
7,168
3,072 -57.1%
Shaders
7,168
3,072 -57.1%
TMUs
224
192 -14.3%
ROPs
80
96 +20.0%
SM Count
56
Clocks
Base Clock
975 MHz
948 MHz
Boost Clock
1635 MHz
1112 MHz
Memory Clock
1563 MHz 12.5 Gbps effective
1502 MHz 6 Gbps effective
Memory
Memory Size
20 GB
12 GB
VRAM (MB)
20,480
12,288 -40.0%
Memory Type
GDDR6
GDDR5
Memory Bus
320 bit
384 bit
Bandwidth
500.2 GB/s
288.4 GB/s
Cache
L1 Cache
128 KB (per SM)
48 KB (per SMM)
L2 Cache
6 MB
3 MB
Performance
Pixel Rate
130.8 GPixel/s
106.8 GPixel/s
Texture Rate
366.2 GTexel/s
213.5 GTexel/s
FP32 (TFLOPS)
23.44 TFLOPS
6.832 TFLOPS
FP64 (TFLOPS)
732.5 GFLOPS (1:32)
213.5 GFLOPS (1:32)
FP16 (TFLOPS)
23.44 TFLOPS (1:1)
AI/RT
RT Cores
56
Tensor Cores
224
Power
TDP
150 W
250 W
TDP (W)
150
250 +66.7%
Suggested PSU
450 W
600 W
Power Connectors
8-pin EPS
8-pin EPS
Architecture
Architecture
Ampere
Maxwell 2.0
GPU Name
GA102
GM200
Generation
Server Ampere (Axx)
Tesla Maxwell (Mxx)
Process Size
8 nm
28 nm
Transistors
28,300 million
8,000 million
Die Size
628 mm²
601 mm²
Foundry
Samsung
TSMC
Density
45.1M / mm²
13.3M / mm²
API Support
DirectX
12 Ultimate (12_2)
12 (12_1)
OpenGL
4.6
4.6
Vulkan
1.4
1.4
OpenCL
3.0
3.0
CUDA
8.6
5.2
Shader Model
6.8
6.8
Physical
Slot Width
Single-slot
Dual-slot
Length
267 mm 10.5 inches
267 mm 10.5 inches
Height
112 mm 4.4 inches
Outputs
No outputs
No outputs
Bus Interface
PCIe 4.0 x16
PCIe 3.0 x16
Other
Production
End-of-life
End-of-life
Predecessor
Tesla Turing
Tesla Kepler
Successor
Server Ada
Tesla Pascal
View A10M Details View Tesla M40 Details