GPU Comparison
GEFORCE
NVIDIA A10G
CORE STATE
GA102
VRAM
24 GB
CLOCK SPEED
1710 MHz
TDP
150 W
BUS WIDTH
384 bit
ARCHITECTURE
Ampere
PROCESS
8 nm
LAUNCH DATE
2021
VS
GEFORCE
Tesla M40
CORE STATE
GM200
VRAM
12 GB
CLOCK SPEED
1112 MHz
TDP
250 W
BUS WIDTH
384 bit
ARCHITECTURE
Maxwell 2.0
PROCESS
28 nm
LAUNCH DATE
2015
PERFORMANCE BENCHMARKS
geekbench_opencl
geekbench_vulkan
DETAILED SPECIFICATIONS
SPECIFICATION
A10G
Tesla M40
Core Specs
Shading Units
9,216
3,072
-66.7%
Shaders
9,216
3,072
-66.7%
TMUs
288
192
-33.3%
ROPs
96
96
0.0%
SM Count
72
—
Clocks
Base Clock
1320 MHz
948 MHz
Boost Clock
1710 MHz
1112 MHz
Memory Clock
1563 MHz
12.5 Gbps effective
1502 MHz
6 Gbps effective
Memory
Memory Size
24 GB
12 GB
VRAM (MB)
24,576
12,288
-50.0%
Memory Type
GDDR6
GDDR5
Memory Bus
384 bit
384 bit
Bandwidth
600.2 GB/s
288.4 GB/s
Cache
L1 Cache
128 KB (per SM)
48 KB (per SMM)
L2 Cache
6 MB
3 MB
Performance
Pixel Rate
164.2 GPixel/s
106.8 GPixel/s
Texture Rate
492.5 GTexel/s
213.5 GTexel/s
FP32 (TFLOPS)
31.52 TFLOPS
6.832 TFLOPS
FP64 (TFLOPS)
985.0 GFLOPS (1:32)
213.5 GFLOPS (1:32)
FP16 (TFLOPS)
31.52 TFLOPS (1:1)
—
AI/RT
RT Cores
72
—
Tensor Cores
288
—
Power
TDP
150 W
250 W
TDP (W)
150
250
+66.7%
Suggested PSU
450 W
600 W
Power Connectors
8-pin EPS
8-pin EPS
Architecture
Architecture
Ampere
Maxwell 2.0
GPU Name
GA102
GM200
Generation
Server Ampere
(Axx)
Tesla Maxwell
(Mxx)
Process Size
8 nm
28 nm
Transistors
28,300 million
8,000 million
Die Size
628 mm²
601 mm²
Foundry
Samsung
TSMC
Density
45.1M / mm²
13.3M / mm²
API Support
DirectX
12 Ultimate (12_2)
12 (12_1)
OpenGL
4.6
4.6
Vulkan
1.4
1.4
OpenCL
3.0
3.0
CUDA
8.6
5.2
Shader Model
6.8
6.8
Physical
Slot Width
Single-slot
Dual-slot
Length
267 mm
10.5 inches
267 mm
10.5 inches
Height
112 mm
4.4 inches
—
Outputs
No outputs
No outputs
Bus Interface
PCIe 4.0 x16
PCIe 3.0 x16
Other
Production
End-of-life
End-of-life
Predecessor
Tesla Turing
Tesla Kepler
Successor
Server Ada
Tesla Pascal