GPU Comparison
GEFORCE
NVIDIA A10M
CORE STATE
GA102
VRAM
20 GB
CLOCK SPEED
1635 MHz
TDP
150 W
BUS WIDTH
320 bit
ARCHITECTURE
Ampere
PROCESS
8 nm
LAUNCH DATE
—
VS
GEFORCE
Tesla P40
CORE STATE
GP102
VRAM
24 GB
CLOCK SPEED
1531 MHz
TDP
250 W
BUS WIDTH
384 bit
ARCHITECTURE
Pascal
PROCESS
16 nm
LAUNCH DATE
2016
PERFORMANCE BENCHMARKS
geekbench_opencl
geekbench_vulkan
DETAILED SPECIFICATIONS
SPECIFICATION
A10M
Tesla P40
Core Specs
Shading Units
7,168
3,840
-46.4%
Shaders
7,168
3,840
-46.4%
TMUs
224
240
+7.1%
ROPs
80
96
+20.0%
SM Count
56
30
-46.4%
Clocks
Base Clock
975 MHz
1303 MHz
Boost Clock
1635 MHz
1531 MHz
Memory Clock
1563 MHz
12.5 Gbps effective
1808 MHz
7.2 Gbps effective
Memory
Memory Size
20 GB
24 GB
VRAM (MB)
20,480
24,576
+20.0%
Memory Type
GDDR6
GDDR5
Memory Bus
320 bit
384 bit
Bandwidth
500.2 GB/s
347.1 GB/s
Cache
L1 Cache
128 KB (per SM)
48 KB (per SM)
L2 Cache
6 MB
3 MB
Performance
Pixel Rate
130.8 GPixel/s
147.0 GPixel/s
Texture Rate
366.2 GTexel/s
367.4 GTexel/s
FP32 (TFLOPS)
23.44 TFLOPS
11.76 TFLOPS
FP64 (TFLOPS)
732.5 GFLOPS (1:32)
367.4 GFLOPS (1:32)
FP16 (TFLOPS)
23.44 TFLOPS (1:1)
183.7 GFLOPS (1:64)
AI/RT
RT Cores
56
—
Tensor Cores
224
—
Power
TDP
150 W
250 W
TDP (W)
150
250
+66.7%
Suggested PSU
450 W
600 W
Power Connectors
8-pin EPS
8-pin EPS
Architecture
Architecture
Ampere
Pascal
GPU Name
GA102
GP102
Generation
Server Ampere
(Axx)
Tesla Pascal
(Pxx)
Process Size
8 nm
16 nm
Transistors
28,300 million
11,800 million
Die Size
628 mm²
471 mm²
Foundry
Samsung
TSMC
Density
45.1M / mm²
25.1M / mm²
API Support
DirectX
12 Ultimate (12_2)
12 (12_1)
OpenGL
4.6
4.6
Vulkan
1.4
1.4
OpenCL
3.0
3.0
CUDA
8.6
6.1
Shader Model
6.8
6.8
Physical
Slot Width
Single-slot
Dual-slot
Length
267 mm
10.5 inches
267 mm
10.5 inches
Height
112 mm
4.4 inches
111 mm
4.4 inches
Outputs
No outputs
No outputs
Bus Interface
PCIe 4.0 x16
PCIe 3.0 x16
Other
Launch Price
—
5,699 USD
Production
End-of-life
End-of-life
Predecessor
Tesla Turing
Tesla Maxwell
Successor
Server Ada
Tesla Volta